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Abstract 

A symmetric subset of the reals is one that remains invariant under some reflection 
X I— > c — X. Given < e < 1, there exists a real number A(e) with the following 
property: if < 6 < A(e), then every subset of [0,1] with measure e contains a 
symmetric subset with measure 6, while if 5 > A(e), then there exists a subset of [0, 1] 
with measure e that does not contain a symmetric subset with measure 5. In this 
paper we establish upper and lower bounds for A(e) of the same order of magnitude: 
for example, we prove that A(e) = 2e- 1 for < e < 1 and that O.Sge^ < A(e) < O.Se^ 
for < e < ii. 

This continuous problem is intimately connected with a corresponding discrete 
problem. A set S of integers is called a B*[g] set if for any given m there are at 
most g ordered pairs (si, S2) £ S x S with si + S2 = m; in the case g = 2, these are 
better known as Sidon sets. We also establish upper and lower bounds of the same 
order of magnitude for the maximal possible size of a B*[g] set contained in {1, . . . , n}, 
which we denote by R{g,n). For example, we prove that R(g,n) < LSl-^/^n for all 
n > g > 2, while R{g,n) > for sufficiently large integers g and n. 

These two problems are so interconnected that both continuous and discrete tools 
can be applied to each problem with surprising effectiveness. The harmonic analysis 
methods and inequalities among various norms we use to derive lower bounds for 
A(e) also provide uniform upper bounds for R(g,n), while the techniques from com- 
binatorial and probabilistic number theory that we employ to obtain constructions of 
large B*[g] sets yield strong upper bounds for A(e). 
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1 Introduction 



A set C C M is symmetric if there exists a number c (the center of C) such that c + x G C 
if and only if c — x G C. Given a set A C [0, 1) of positive measure, is there necessarily a 
symmetric subset C C A of positive measure? The answer turns out to be "yes", and the 
main topic of this paper is to determine how large, in terms of the Lebesgue measure of A, 
one may take the symmetric set C. In other words, for each e > we are interested in 

f^. every measurable subset of [0, 1) with measure el 

' ' contains a symmetric subset with measure 5 J ' 

It is not immediately obvious that A{6) > 0. 

We have dubbed this sort of question "continuous Ramsey theory", and we direct the 
reader to Section 2 for problems with a similar flavor; some of these have appeared in the 
literature and some are given here for the first time. 

We determine a lower bound for A{e) using tools and ideas from harmonic analysis, 
nonstandard analysis, and the theory of wavelets. We also construct sets without large 
symmetric subsets using results from the theory of finite fields and probabilistic number 
theory. These two lines of attack complement each other, and our bounds yield new results 
in additive number theory as well. 

Consider first the analogous discrete problem: given a set A C {1, 2, . . . , n}, how large is 
the largest symmetric subset of A7 There are ~ pairs of distinct elements of A (where 
l^l denotes the cardinality of A), and each pair (a, 6) has a center ^ which is among the 
~ 2n values {|, 2, |, 3, . . . , ^^y^}- By the pigeonhole principle, there is some c that is the 
center of at least ~ (2n) pairs of elements of A. The union of those pairs is a symmetric 

set, i.e., there is a symmetric set C C A with 

|gU2||Ap_im| 

n ^ n 2n 2 \ n 

In other words, the density of C is roughly at least one-half the square of the density of A. 

A Sidon set is a set A of integers with the property that distinct pairs of elements have 
distinct sums: if a,b,c,d & A and a + b = c + d, then {a, b} = {c, d}. This is equivalent to 
asserting that A has no symmetric subsets with more than 2 elements. It is known |^in38|| 
that there is a Sidon set A contained in {1, 2, . . . ,n} with |y4| ~ ^/n. Thus, if C C yl is a 
symmetric set, then 

\C\ 2 f\A\ 
n n \ n 

In other words, the density of C is roughly at most twice the square of the density of A. 

For the discrete version of the problem, at least, we see that one can guarantee a 
"quadratically large" symmetric subset, and that one cannot do better in general. 

1.1 Continuous Results 

Let A denote Lebesgue measure on M. We are led by analogy with the discrete problem to 
guess that every subset A C [0, 1) has a symmetric subset with measure ^A(A)^, and that 
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there are subsets A C [0, 1) that do not have symmetric subsets with measure larger than 
2A(A)2. That is, we are led to guess that < A(e) < 2e'^. 

We find the following equivalent definition of A{e) easier to work with than the definition 
given in Eq. (|lD: if we define 

D{A) := sup{A(C) : C C A, C is symmetric}, (2) 

then 

A(£) := inf{D(A): A C [0,1), X{A) = e}. (3) 

Write A{x) for the indicator function of a set A, and define the sumset A + A : = 
{ai + a2'- CLi E A}. We notice first that the maximal symmetric subset of A with center c is 
Ar\{2c—A), where 2c— A = {2c— a: a G A}; this intersection has measure J A{x)A{2c—x) dx, 
which can be written as the value of the convolution A*A{2c). This means that D{A) simply 
equals the supremum norm \\A* A\\oo- Since supp(A * A), the support of the function A* A, 
equals A + A (up to a set of measure zero) and is thus contained in [0, 2), we see that 

\\A*A\\ >^^^^^-^<^>\(A? 
1'"^ - A(supp(A * A)) - X{A + A)- 2^^^^ ' 

and hence A(£) > ^e'^ (as we had guessed from the analogy with the discrete case). This 
lower bound, which we shall call the trivial lower bound on A(e), is not so far from the best 
we can derive! In fact, the bulk of this paper is devoted to improving the constant in this 
lower bound from | to 0.591389. Moreover, we are able to establish a complementary upper 
bound for A{e) in a manner that we shall discuss in the next section. 

In addition to a heavy reliance on Fourier analysis, we make use of wavelets, albeit only 
in a rather naive manner. Although much of our argument (and indeed the initial problem 
of bounding A{e)) was initially motivated by nonstandard analysis, we do not make direct 
use of it here. 

Figure |l| shows the precise upper and lower bounds we obtain for A{e)/e'^ as functions 



of e, and Theorem |1.1| gives the highlights: 
Theorem 1.1. We have: 

I. A{e) = 2£ - 1 /or i| < £ < 1, and A{e) > 2e - 1 /or | < e < i|; 

II. A(5) > 0.591389e2 for all < e < 1; 

III. A{e) > 0.5546e2 + 0.088079£=^ for all < e < 1; 



IV. A{e) < §^e^ < 0.7934£2 /or < e < i|, 



V. Ais) < (j^^^ = js^ + Oie') for allO<e<l. 

Note that ^ < 0.7854. The upper bound in part (v) of the theorem is superior to the one 
in part (iv) in the range < e < ^{8\^6n — llvr) = 0.0201. The five parts of the theorem 



are proved separately in Proposition [7.1| , Proposition [4.12| , Proposition |4.16| , Corollary 
and Proposition [73| , respectively. 
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Figure 1: Upper and Lower Bounds for /\{e)/e^ 

Thus we have estabhshed that is the correct order of magnitude for the function A, 
and we have improved upon both the constants | and 2 which appeared in our heuristic. In 
the following subsection, we discuss how we might improve the corresponding constants for 
the discrete problem as well. 



1.2 Connecting the Continuous to the Discrete 



It turns out that the upper bounds given in Theorem |l.l| (iv)-(v) are derived from number- 
theoretic considerations. A set S of integers is called a B2 [g\ set if for any given m there are 
at most 2g ordered pairs (si, S2) G S* x S* with Si + S2 = m (equivalently, if the coefficients of 
(Snes-^") bounded by 2g). Sidon used -B2[l] sets, which are the Sidon sets mentioned 
earlier, as a tool in his study of Fourier series. It is perhaps fitting that we now use Fourier 
analysis as a tool in our study of B2[g\ sets. 

Many constructions of B2 [g] sets have appeared in recent years. Our constructions of sets 
of reals without large symmetric subsets are based on known constructions of B2 [g] sets. This 
geometric aspect of B2[g] sets has led us to generalize and optimize these constructions further 
and to give improved upper bounds on the density of B2[g\ sets contained in {1, 2, . . . , ri}. 

Given a B2[g\ set S" C {1, 2, . . . , n} and any integer m, the union of all pairs (si, S2) G 
S X S with si + S2 = m is the largest symmetric subset of S with center y, and all symmetric 
subsets of S arise in this way. From the definition of a B2[g] set, we see that S contains no 
symmetric subset with more than 2g integers. In light of this, it is not surprising (though 
it certainly requires proof — see Proposition |5.1|) that the largest symmetric subset of the set 
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of real numbers 

A(5):=Uf^,-)^[0,l) (4) 
^ L n n/ 

has measure at most ^. In other words, < D(A(S)) < ^. 

For technical reasons, it is more convenient for us to speak of B*[g] sets rather than B2[g] 
sets. A set S of integers is called a B* [g] set if for any given m there are at most g ordered 
pairs (si, S2) & S x S with Si + S2 = m. Note that the definitions of B2[g] sets and B*[2g] 
sets coincide, but we shall also consider B*[g] sets with g odd. We introduce the function 

R{g,n) ■.= max{\S\: S C {1,2, ... ,n}, S is a B*[g] set}. (5) 

Since there are 15"^ sums of pairs from S, while there are fewer than 2n possible sums 
each of which can be realized at most g times, we immediately deduce the upper bound 
R{g,'n) < -\/2(7n, which we shall call the trivial upper bound for R{g,n). 

The construction of ^(5*) indicated above thus gives the bound A ( '^^^'"^ ^ < S., and we 
can use this inequality to give an upper bound on R{g, n). The trivial lower bound on A(£:) 
gives |(-^^%^) < A ( ^^^''^^ ^ < ^, whence R{g,n) < ^2gn. Thus, the trivial lower bound on 
A(£:) implies the trivial upper bound on R{g,n), and any improvement we can make on the 
trivial lower bound for A{e) will immediately provide stronger upper bounds for R{g,n). In 
the same way, we shall use lower bounds on R{g,n) to derive upper bounds on A{e). All 
this is made rigorous in Sections O and 0. 

In fact much more than A ( ^'^'"^ j — f true. Proposition ^.3| below states that 

A{e) = mf{f^: n> g >1, R{g, n) > ne}. 

Thus a sufficient understanding of the dependence of R{g,n) on g and n would completely 
solve our continuous problem. Unfortunately, this understanding is still somewhat lacking. 



1.3 Discrete Results 



The true size of Sidon sets is essentially known. Erdos and Turan |[ET41|| exploited the fact 



that the pairwise differences si — S2 from a Sidon set are distinct to establish the improved 
upper bound R{2, n) < ^Jn. (By the notation f{n) ^ g{n), we mean that limsup^^^^ < 
1.) Ruzsa |[Ruz93|| has observed that the Erdos/Turan argument can be modified to give also 



-^(3, n) < y/n. A construction of Singer ||Sin38|] (see Proposition ^]T|(iii) below) yields the 
complementary lower bounds R{3, n) > R{2, n) > ^/n. 

Present knowledge of R{g, n) for > 3 is much less impressive. Constructions of large 
B*[g] sets have so far yielded only moderate lower bounds on R{g,n). The trivial upper 
bound on R{g,n) has been improved for general g, but only quite recently. In this paper, 
we present the strongest upper bound to date on R{g, n) for all (7 > 21 as well as for all odd 
g >5, and we also improve or match the best known lower bounds on R{g,n). 

The seminal paper of Cilleruelo, Ruzsa, and Trujillo | |UK1] | gave the first upper bound 
for R{g,n) that was nontrivial for infinitely many g, namely R{2g,n) < 1.3181\/2^, and 
Green [Pre01|| improved this to R{2g,n) < 1.3038a/2^. Green also proved 



R{2g,n)<Jl{2g-l)n, (6) 
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which is stronger than our results for even integers g < 20. (In both these papers, the results 
were stated in terms of the function F2{g,n), a notational difference only as F2{g,n) = 
R{2g,n) for all g and n.) 



It seems likely that lim„ 
g = 2 and g = 3. We define 



R{g,n) 



gn 



exists for every g^ but this has been proved only for 



p{9) 



lim inf 

n— >oo 



R{g,n) 



p{g) := lim sup 



gn 
R{g,n) 



gn 



Pig) ■■- 



lim 

rx— >oo 



R{g,n) 



gn 



Thus p(2) = p(3) = and p{g) is not known to be well-defined for g > 4:. Green's 



result ||Gre01|| is equivalent to p{g) < yly 1 — K 

The following theorem gives the bounds on R{g,n) that we prove in this work. Note 
that for even g < 20, Green's bound (BT) is superior. 



Theorem 1.2. R{g, n) < 1.30036^p: for all g and R{g, n) < 1. 31925 ^/{g^^^T)n + \ if g 
odd. Further, we have the following upper hounds forji^g)"^: 



IS 



P{9? < 



1.74043 -M0483, 

0.026335 _|_ /g 011572 — 0-083397 _|_ 0.00069356 



1.58337- 



1.74043-^^, 
1^58337 _MZi949 



+ J0.011572 



0.22784 , 0.0051768 
7, 1 771 ) 



g < 8 and even; 
g > 10 and even; 
g < 23 and odd; 
g >25 and odd. 



We comment that our result is an improvement in two aspects: we improve the numerical 
constants in the bound on p(g) and we replace "<" with "<" in the bounds on R{g, n). These 
minor improvements aside, we believe that our geometrical approach is easier to follow and 
promises future improvements. Accordingly, throughout this paper we discuss the quality 
of the results, what could possibly be improved and what is best possible. In addition, to 
the best of our knowledge. Theorem |1.2| gives the first general upper bound for R{2g — 1, n) 
that improves upon the trivial inequality R{2g — 1, ra) < R{2g, n). Theorem |1.2| is proved in 
Corollary ^]2| and Proposition |5.4| . 

We turn now to lower bounds for R{g,n). Constructions of "i?*[2] (mod n)" sets have 
been given by Singer [pin38|] , Bose [Pos42|| , and Ruzsa [[Ruz93|| . These constructions were 
extended to B*[g] sets in ||CRT|| . In Propositions ^]l| and below, we generalize the first 
three constructions and further optimize the extension given in [PRT|| . 

Theorem [O] presents our improved lower bounds for R{g,n), stated in terms of the 
function p{g). 
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Figure 2: Lower bounds on p{g) and upper bounds on p{g) 

Theorem 1.3. We have 

p(4)>_2^ > 0.755, p(i4) > > 0.759, 

P(6) > ^ > 0.730, p(16) > ^ > 0.775, 

p(8) > ^ > 0.755, p(18) > ^ > 0.769, 

p(10) > 5^ > 0.737, P(20)>^ > 0.778, 

p(12)>^ > 0.774, p(22) > ^ > 0.767, 

and for any g > 12, 

« + 2U/3J + U/6J 

- - - 2g [9/3J + 2g 

/n particular, for any 6 > we have R{g, n) > — 5)^fgn if both g and ^ are sufficiently 
large in terms of 5. 

We note that > 0.7938. The lower bound for p(4) reproduces a result of Habsieger 
and Plagne ||HP|| , while the lower bounds for p(6) and p(10) reproduce results of ||CRT|] ; 
for other even g, our lower bounds are new. These lower bounds on p, together with the 



strongest known upper bounds on p including those derived from Theorem p..2| , are plotted 
for 2 < 51 < 42 in Figure ||. 

To the authors' knowledge, nothing more is known about lower bounds for 
R{2g + l,n) for general n and g than the obvious inequality R{2g,n) < R{2g + l,n). In 
particular, 

^^i?(2(?,n) ^ R{2g + l,n) 



2g+l " x/WTT] 



n 
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which imphes that 



+ 2^ 



It seems hkely that this inequahty is actually an equality (i.e., when g is fixed, the quotient 
R{2g + l,n)/ R{2g,n) should tend to 1 as tends to infinity), but this is known only for 
g = l [lRuz93|| . 



Theorem 1.3 contains our best results for fixed g, but we can obtain a better constant 



in the lower bound if we allow g to grow slowly with n: 

Theorem 1.4. For any 5 > 0, we have R{g,n) > — 5^^/gn if both and ^ are 
sufficiently large in terms of 6. 



We note that 2/^/7? > 1.128. Theorems |1.3| and |1.4| are both important for obtaining 



the upper bounds on A{e) given in Theorem |1 . 1| . Theorem ^]3| is proved in Section |6.6| 



through explicit constructions, while Theorem is proved in Section |6.5| by a probabilistic 
argument. 

As mentioned above, in Section 2 we describe several other problems in continuous 
Ramsey theory, along with their discrete analogues. In Section 3 we list some easy-to-derive 
properties of the function A(e), while in Section 4 we describe how we establish our strongest 
lower bounds for A{e). Section 4 is the most involved part of this paper, employing many 
techniques from harmonic analysis, most notably Fourier series. The connection between 
the continuous and discrete problems measured by A(e) and R{g,n), respectively, is given 
in Section 5, and our upper bounds for R{g, n) are derived therein. In that section, we also 
establish in Theorem |5]^ that large B* [g] sets must have many pairwise sums that repeat 



at least ag times for suitable a, and in Theorem |5.9| we show a stronger upper bound for 
B*[g\ sets that are uniformly distributed in {1, ...,«} (which we conjecture is typical for 
the largest possible B*[g\ sets). Section 6 is devoted to lower bounds for R{g, n), which rely 
upon improved constructions of B* [g] sets and their "mod n" counterparts. In Section 7 we 
apply the results of Section 6 to bound A (5) from above. Finally, in Section 8 we collect 
together several open questions and problems relating to our methods. 
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2 Some Problems in Continuous Ramsey Theory 



A "coloring Ramsey theorem" has the form: 

Given a sufficiently large set of mathematical objects colored with a finite number 
of colors, there is a highly structured monochromatic subset. 

The prototypical example is Ramsey's Theorem itself: However one colors the edges of 
the complete graph with r colors, there is a monochromatic complete subgraph on t 
vertices, provided that n is sufficiently large in terms of r and t. Another example is van der 
Waerden's Theorem: However one colors the integers {1,2, . . . ,n} with r colors, there is a 
monochromatic arithmetic progression with t terms, provided that n is sufficiently large in 
terms of r and t. 

In many cases, the coloring aspect of a Ramsey-type theorem is a ruse and one may 
prove a stronger statement with the form: 

Given a sufficiently large set R of mathematical objects, any large subset of R 
contains a highly structured subset. 

Such a result is called a "density Ramsey theorem." For example, van der Waerden's 
Theorem is a special case of the density Ramsey theorem of Szemeredi: Every subset of 
{1, 2, . . . , n} with cardinality at least n5 contains a t-term arithmetic progression, provided 
that n is sufficiently large in terms of 6 and t. 

Ramsey theory is the study of such theorems on different types of structures. By "contin- 
uous Ramsey theory" we refer to Ramsey-type problems on continuous measure spaces. In 
particular, this thesis is concerned with a density- Ramsey problem on the structure [0, 1) C M 
with Lebesgue measure. The type of substructure we focus on are symmetric subsets. 

There is frequently a connection between problems concerning [0, 1) with Lebesgue mea- 
sure and problems concerning {1, 2, . . . , n} with a uniform probability measure. We attempt 
to make this connection more explicit in our descriptions of problems in the remainder of 
this section. 



2.1 The Correlation Problem 

Erdos asked for upper and lower bounds on M„(|), where 

|(S + c) nT| 



M„(«) := 



inf 

\S\ = \_an\ 
T={l,2,...,n}\5 



sup 



n 



In essence, Erdos asked for a formalization of the observation: a large subset of {1, 2, 
can always be translated so as to have a large intersection with its complement. 
S. Swierczkowski Pwi58|| showed that lim„^oo j^n(tt) = M(a), where 



,n} 



M{a):= inf sup A((5 + c) H T). 

X{S)=a ceM 

r=[o,i)\5 
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Working in the continuous setting, Swierczkowski showed that 

2- 10a(l 



M{a) > 



a) 



which improved on the asymptotic bounds then known for M„(a). 

Guy94| , problem CI 7]) that 0.178 < M(|) < 0.2, but the precise value 



It is known (see 
remains unknown. 



2.2 The Convolution Problem 

Banakh, Verbitsky, and Vorobets ||BV V00|| considered the coloring version of the problem 



considered in the present paper: given a measurable finite coloring of [0,1), how large a 
monochromatic symmetric set is guaranteed? They claim that if 2 colors are used, then 
there is necessarily a monochromatic symmetric subset with measure ~ 0.155, and if 

r > 2 colors are used then there is a monochromatic subset with measure Unfortunately, 
the proofs of several crucial lemmas of this interesting paper appear only in earlier Russian- 
language articles, and we have been unable to verify their proofs. If r colors are used, then 
there is a monochromatic set with measure at least ^. In Section |46| , we show that any subset 
of [0, 1) with measure at least - contains a symmetric subset with measure °:^2i389^ 
Section we show further that any subset with measure at least | has a symmetric subset 
with measure 0.1496. This paper is concerned with the density version of their coloring 
problem. 

We call this the convolution problem because, if / is the indicator function of E (1 W, 
then f * f{c) := f^f{x)f{c — x) dx is the measure of the maximal symmetric subset of E 
with center c/2. Therefore the Banakh- Verbitsky- Vorobets problem is equivalent to finding 
the infimum of the possible values of 

max{||/*/|U,||(l-/)*(l-/)|U} 

as / ranges over all indicator functions of measurable subsets of [0,1). We exploit this 
connection to Fourier analysis to give lower bounds for the function A{e) defined in Eq. (|^). 

We note that the discrete versions of these problems are closely related to the continuous 
versions. Specifically, define MS{n,r) to be the maximum of integers M such that for any 
coloring of {1,2, ...,n} with r colors, there is a monochromatic symmetric subset with 
cardinality M. Also, define MS'([0, 1], r) to the supremum of real fi such that for any 
measurable coloring of [0, 1] with r colors, there is a monochromatic subset with measure 
/i. It is shown in [|BVVOO|] that MS'([0, l],r) = limn^oo MS {n,r)/n. Proposition ^73| below 



gives the analogous connection between A{e) and the function R{g,n) defined in Eq. (^. 
2.3 The Linear Equation Problem 

Following Chung, Erdos, and Graham ||CEC| ], for any matrix A, we say that S* C {1, 2, . . . , ra} 
is y4-hitting if for every vector x = {xi, X2, ■ ■ ■ , Xg) with G {1, 2, . . . , n} and xA = 0, either 
there is some Xi E S or xi = X2 = ■ ■ ■ = Xs (this second condition is included to avoid certain 
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trivialities). Set 5n to be the minimum density of an A-hitting set, i.e., 

(5„ :=min|^: {l,2,...,n}, ^ is A-hitting| . 

If, for example, A = (1, —2, 1), then 6n is the minimum density of a subset of {1,2,..., n} 
that intersects every 3-term arithmetic progression in {1, 2, . . . , n}. 

We also say that S C [0,1] is A-hitting if for every vector x = {xi,X2, ■ ■ ■ ,Xs) with 
Xi e [0, 1] and xA = 0, either a;i = X2 = ■ ■ ■ = or there is some Xi G S. Set 6 to be the 
infimum of the measures of A-hitting sets, i.e., 

S := inf {X{S) : ^ C [0, 1], ^ is /l-hitting} . 

Given our experience with the above correlation and convolution problems, one might 
guess that liminf„5„ = 6. In ||CEG|| , the values of both 6 and liminf„(5„ are derived for 
several matrices A, and indeed equality seems to be the typical case. Surprisingly, however, 
there are non-trivial examples where this is not the case. For 



A 



for example, it is shown in ||CEG|] that 5=4 whereas 0.199 < liminf„5„ < 0.1997. 
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3 Easy Bounds for A{e) 



We now turn our attention to the investigation of the function A{e) defined in Eq. (j^). In 
this section we estabhsh several simple lemmas describing basic properties of A. 

Lemma 3.1. A{e) > 26 - 1 for all < e < 1. 

Proof. For A C [0, 1), the centrally symmetric set An {1 — A) has measure equal to 

X{A) + A(l -A)- X{A U (1 - A)) = 2X{A) - X{A U (1 - A)) > 2X{A) - 1. 

Therefore D{A) > 2X{A) — 1 from the definition (|^) of the function D. Taking the infimum 
over all subsets A of [0, 1) with measure e, this becomes A(e) > 2e — 1 as claimed. ■ 

While this bound may seem obvious, it is in many situations the state of the art. As we 



show in Proposition ITTT] below, A{e) actually equals 25 — 1 for < e < 1; and A{e) > 2e — 1 



is the best lower bound of which we are aware in the range 0.61522 < e < ii = 0.6875. 



One is tempted to try to sharpen the bound A(e) > 2e — l by considering the symmetric 
subsets with center 1/3, 1/2, or 2/3, for example, instead of merely 1/2. Unfortunately, it 
can be shown that given any £ > and any finite set {ci, C2, . . . , Cn}, one can construct a 
sequence Sk of sets, each with measure e, that satisfies 



lim max {A {Sk n (2ci - Sk))} = 2e - 1. 

fc— >oo \ l<i<n J 

Thus, no improvement is possible with this sort of argument. 
Lemma 3.2 (Trivial Lower Bound). A{e) > for allO < e <1. 



Proof. We repeat the argument given briefiy in Section LI. Given a subset A of [0,1) of 
measure e, let A{x) denote the indicator function of A, so that the integral of A{x) over any 
interval containing [0, 1) equals e. If we define /(c) := A{x)A{2c — x) dx, then /(c) is 
the measure of the largest symmetric subset of A with center c, and we seek to maximize 
/(c). But / is clearly supported on [0, 1), and so 



D(A) = max/(c)> / fic)dc= / / 

^ Jo J~oc Jo 



A{x)A{2c — x) dcdx 



A{x) ( ^ / A{w) dw]dx = 



2 

Since A was an arbitrary subset of [0, 1) of measure e, we have shown that A(£:) > ^^e^. ■ 
Lemma 3.3. A(£:) < for all < e < x < 1. In particular, A(£:) < e. 

Proof. If tA := {ta: a G A} is a scaled copy of a set A, then clearly D{tA) = tD{A). 
Applying this with any set A C [0, 1) of measure x and with t = | < 1, we see that 
^A is a subset of [0, 1) with measure e, and so by the definition of A we have A{e) < 
D{^A) = ^D{A). Taking the infimum over all sets A C [0, 1) of measure x, we conclude 
that A(£) < ^A{x). The second assertion of the lemma is obvious, and indeed it follows 
from the first assertion in light of the trivial value A(l) = 1. ■ 
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It is obvious from the definition of A that A(e) is an increasing function; 
Lemma ^.31 shows that is also an increasing function. Later in this paper (see Proposi- 
tion |7.2|) , we will show that in fact even is an increasing function. 

Let 

SoT := {S\T)U{T\S) 

denote the symmetric difference of 5* and T. (While this operation is more commonly denoted 
with a triangle rather than with a diamond, we would rather avoid any potential confusion 
with the function A featured prominently in this paper.) 

Lemma 3.4. If S and T are two sets of real numbers, then \D{S) — D{T)\ < 2X{S oT). 

Proof. Let E be any symmetric subset of S, and let c be the center of E, so that E = 2c — E. 
Define F = E r\T H (2c — T) , which is a symmetric subset of T with center 2c. We can write 
A(-F) using the inclusion-exclusion formula 

X{F) = X{E) + X{T) + X{2c-T) 

- X{E U T) - X{E U (2c - T)) - A(T U (2c - T)) + X{EUTU (2c - T)). 

Rearranging terms, and noting that T U (2c — T) (1 E UT U (2c — T), we see that 

X{E) - X{F) < -A(T) - A(2c - T) + X{E U T) + X{E U (2c - T)) 

Because reflecting a set in the point c does not change its measure, this is the same as 

X{E) - A(F) < -A(T) - A(r) + X{E U T) + X{E U (2c - T)) 
= -A(T) - A(T) + X{E U T) + A((2c -E)UT) 
= 2{X{EUT) - A(r)) 

< 2{X{S U T) - A(T)) = 2X{S \ T) < 2X{S o T), 

Therefore, since F is a symmetric subset of T, 

X{E) < X{F) + 2X{S o T) < D{T) + 2X{S o T). 

Taking the supremum over all symmetric subsets E of S, we conclude that D{S) < D{T) + 
2A(5' oT). If we now exchange the roles of 5* and T, we see that the proof is complete. ■ 

Lemma 3.5. The function A satisfies the Lipschitz condition |A(x) — A{y)\ < 2\x — y\ for 
all X and y in [0, 1]. In particular, A is continuous. 

Proof. Without loss of generality assume y < x. In light of the monotonicity A{y) < A(x), it 
suffices to show that A(?/) > A(x) — 2{x — y). Let S C [0, 1) have measure y. Choose any set 
-R ^ [0, 1) V'S' with measure x — y, and set T = SUR. Then SoT = R, and so by Lemma 
D{T) - D{S) < 2X{R) = 2{x - y). Therefore D{S) > D{T) - 2{x - y) > A(x) - 2{x - y) 
by the definition of A. Taking the infimum over all sets S C [0, 1) of measure y yields 
A(?/) > A(a;) — 2(x — y) as desired. ■ 



15 



4 Lower Bounds for A(e) 



Section ^]T| below makes explicit the connection between A (5) and harmonic analysis. Sec- 
tion gives a simple, but quite good, lower bound on A{e). In Section ^73| , we give a more 
general form of the argument of Section [4.2| . Using an analytic inequality established in 
Section we investigate in Section the connection between ||/ * / ||oo and the Fourier 
coefficients of /. In Section [4.6| , we combine the results of Sections |4.2| and to show 
^(e) > 0. 5913896:^. The bound given in Section [4.2| and improved in Section |4.6| depends on 
a kernel function with certain properties; in Section |4.7| we discuss how we chose our kernel. 



In Section |4]^, we use a different approach to derive a lower bound on A{e) which is superior 

5 



for I 



< e < 



4.1 Introduction and Notation 



There are many ways to define the basic objects of Fourier analysis; we follow |[Fol84|| . Unless 
specifically noted otherwise, all integrals are over the circle group T := M/Z; for example, 
denotes the class of functions / for which dx is finite. For each integer j, we define 

/(j) := / f{x)e~'^'^^^^dx, so that for any function / G L^, we have f{x) = /(i)^^'^*"'^ 
almost everywhere. We define the convolution f *g{c) = J f{x)g{c — x) dx, and we note that 

/ * gU) = f{j)g{j) for every integer j; in particular, / * f{j) = f{] f. 
We define the usual norms 



{j\f{x)\^dx) 



i/p 



and 



lim 

p— >oo 



sup{?/: A({x: > y}) > O}. 



With these definitions. Holder's Inequality is valid: if p and q are conjugate exponents — that 



IS. 



P Q 



\\f*f Wi- 
tt = {aj}j, 



1— then \\fg\\i < ||/||p||fi'||g. We also note that \\f*g\ 

- 11/11? = hoy 



then 



i||(7||i; in particular. 

We shall also employ the norms for bi-infinite sequences: if 

i/p 



(E 



we use the same notation for the and 



sup 



Although 



P'\^"' and ||a||oo = linip^oo 

fp norms, no confusion should arise, as the object 



inside the norm symbol will either be a function on T or its sequence of Fourier coefficients, 
respectively. With this notation, we recall Parseval's identity 



f{x)g{x) dx = ^f{j)g{- 



-3] 



■- g is real- valued (so that 
2- The Hausdorff- Young 
whenever p and q are conjugate exponents with 1 < p <2 < q < 00, 
can be thought of as a generalization of this latter version of Parseval's identity. We also 
require the definition 



(assuming the integral and sum both converge); in particular, if / 
/(— j) is the conjugate of /(j) for all j), this becomes 
inequality, < 



i/p 



Jil>"i 



(7) 
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for any sequence a — {aj}j^z, so that o\\a\\p — \\a\\p, for example. 

Wc recall that the Fourier coefficients of any function f E satisfy the estimate f{j) — 
O Q^; in particular, is finite for all p > 1. Moreover, if / e is continuous, then 

fU) — O (j^^- We also note that for any fixed sequence a — {aj}j^z, the £*'-norm ||a||p is 
a decreasing function of p. To see this, suppose that 1 < p < g < oo and a e Then 
Wjl ^ Ikllp for all J G whence < (since q—p > 0) and so \aj\'^ < ||a||^~P|aj|^. 

Summing both sides over all j G Z yields ||a||^ < — and taking gth roots 

gives the desired inequality ||a||g < ||a||p. 

Finally, we define a "pdf", short for "probability density function", to be a nonnegative 
function in whose L^-norm (which is necessarily finite, since T is a finite measure space) 
equals 1. Also, we single out a special type of pdf called an "nif", short for "normalized 
indicator function", which is a pdf that only takes one nonzero value, that value necessarily 
being the reciprocal of the measure of the support of the function. (We exclude the possibility 
that an nif takes the value almost everywhere.) Specifically, we define for each C T the 
nif 

ix{E)-^ xeE, 
[0 x^E. 



fE{x) 



Note that if / is a pdf, then 1 = /(O) = /(O)^ = = ||/ * f\\,. 

We are now ready to reformulate the function A.{e) in terms of this notation. 

Lemma 4.1. We have 

l^^inffl llf *flloo < |£^inf/||/*/||oo = A(£), 

the first infimum being taken over all pdfs g that are supported on [— |, |], and the second 
infimum being taken over all nifs f whose support is a subset o/[— |, |] of measure |. 

Proof. The inequality is trivial, since every nif is a pdf; it remains to prove the equality. 

For each measurable A C [0, 1), define Ea ■= {|(a — I) - a e A] C [— i, i]. The sets A 
and Ea differ only by translation and scahng, so that X{A) — 2X{Ea) and D{A) — 2D{Ea). 
Thus 

A{6) := mf{D{A) : A C [0, 1), X{A) = e} 



inf \ ^ ^ ^ [0, 1), K^) = e 



{2X{EaW 

For each E C [— |, i] with X{E) = |, the function /^(a;) is an nif supported on a subset of 
[— |, |] with measure |, and it is clear that every such nif arises from some set E. Thus, it 
remains only to show that = \\fE * /e||oo, i-e., that D{E) = X{EY\\fE * /b||oo- 
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Fix E C [— |, and let E{x) be the indicator function of E. Note that fE{x) = 
X{E)~^E{x). The maximal symmetric subset of E with center c is E (1 (2c — E), and this 
has measure J E{x)E{2c — x) dx. Thus 

D{E) := sup{A(C) : C CE,C is symmetric} 

= sup I / E{x)E{2c — x) dx 



sup (^j \{E)fE{x)\{E)fE{2c- x) dx 
\{Efsn^ (^j fE{x)fE{2c-x) dx 



= X{EfsnpfE*fE{2c) 

c 

= m'\\fE*fE\L, 

as desired. ■ 

The convolution in Lemma |4.1| may be taken over M or over T, the two settings being 
equivalent since / * / is supported on an interval of length 1. In fact, the reason we scale 
/ to be supported on an interval of length 1/2 is so that we may replace convolution over 
M, which is the natural place to study A{e), with convolution over T, which is the natural 
place to do harmonic analysis. 

4.2 The Basic Argument 

We begin the process of improving upon the trivial lower bound for A{e) by stating a simple 
version of our method that illustrates the ideas and techniques involved. 

Proposition 4.2. Let K he any continuous function on T satisfying K{x) > 1 when x G 
i]' ^'^^ f be a pdf supported on [—\,\]- Then 

\\f*f\\oo>\\f*f\\i>\\K\\-,;,. 

Proof. We have 

f{x)dx< j f{x)K{x)dx = J2fU)KH) 

by Parseval's identity. Holder's Inequality now gives 1 < ||/||4||i^||4/3, which we restate as 
the inequality ||i^||~^3 < II/II4. 

Now 11/11^ = J2j = 1/ * = 11/ * fWl by another application of Parseval's 

identity. Since {f*ff < \\f*f\Uf*f), integration yields < i|/*/||oo||/*/||i = ||/* 

/lloo- Combining the last three sentences, we see that H-R'H^^g < II/II4 = ||/*/||2 < ||/*/||oo 
as claimed. ■ 

This reasonably simple theorem already allows us to give a nontrivial lower bound 
for A{e). 
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Figure 3: The function K-i{x) 



The step function 

KAx) :-- 




0<\x\<\ 

4^1-^1—2 



has 1 1^1 1 1 



7r4+24C{|)^(5+24/3-28/3) 

> 1.074 (the elaborate constant used in the definition of Ki 



8(24/3_i)\(|)^ ^ 

was chosen to minimize Hi^im/a); a careful reader may complain that is not continuous. 
The continuity condition is not essential, however, as we may approximate Kx by a continuous 
function L with ||-^v||4/3 arbitrarily close to ||-ft'i||4/3. Green [|Gre01|] used a discretization of 
the kernel function 



K.{x\ 



a + a (40 (2x 

8 



< < i 
) \<\A<\ 



with a suitably chosen a to get ||i^'2|l4/3 ^ f ^ 1.142. We get a slightly larger value of HfTll^/g 
in Corollary [4.3| with a much more complicated kernel. See Section [4.7| for a discussion of 
how we came to find our kernel. 

Corollary 4.3. If f is a pdf supported on [—\, |], then 

11/ */||2> 1.14915. 
Consequently, ^{e) > 0.574575£2 for allO <e <1. 
Proof. Set 



K.(x) 



0.6644 + 0.3356 ( ^tan-^ 



l-2x 



1.2015 



0<|a;|<i, 
i<l^l<i 



K^{x) is pictured in Figure ^ 
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We do not know how to rigorously bound ||it^3||4/3, but we can rigorously bound ||-ft'4||4/3 
where i^^4 is a piecewise linear function 'close' to K^,. Specifically, let K4^{x) the even piecewise 
linear function with corners at 



We calculate (using Proposition |4. 14| below) that ||-ft'4||4/3 < 0.9658413. Therefore, by Propo- 
sition t4.2| we have 



11/ * fWl > (0.9658413)-^ > 1.14915. 
Using Lemma |4.1| , we now have A(e) > k2(l. 14915) > 0.574575e2. 



The constants in the definition (^) of K^[x) were numerically optimized to minimize 
||-ft'4||4/3 and otherwise have no special significance. The definition of K^{x) is certainly not 
obvious; there are much simpler kernels that do give nontrivial bounds. In Section |4.7| below, 
we indicate the experiments that led to our choice. 

We note that the function 

h{x) := I 71^' -^'^ < ^ < 
I 0, otherwise 

has j b = 1 and ||6 * 6II2 < 1.14939. Although b is not a pdf (it is not in L^), it gives strong 
testimony that the bound on ||/ * /II2 given in Corollary [4.3| is not far from best possible. 

Note that this corollary, along with Corollary |5.4| , gives R{g,n) < 1.31925^/^, only 
slightly worse than the bound given in | |CK1'| | . 



This bound on ||/ * /II2 may be nearly correct, but the resulting bound on ||/ * / ||oo is 
not: we prove below that ||/ * /||oo > 1.182778, and believe that ||/ * /||oo > 7r/2. We have 
tried to improve the argument given in Proposition [1.2| in the following four ways: 



1. Instead of considering the sum f{i)K{—j) as a whole, we single out the central 
terms (which dominate the sum) and the tails (which contribute essentially nothing), 
and deal with the three resulting sums separately. This generalized form of the above 
argument is expounded in the next section. The success of this generalization relies 
on certain inequalities restricting the possible values of these central coefficients; es- 



tablishing these restrictions is the goal of Sections [4.4| and [4.5| . The final lower bound 
derived from these methods is given in Section W^. 



We can try to find more advantageous kernel functions K{x) for which we can compute 
||i^||4/3 in an accurate way. A detailed discussion of our search for the best kernel 



functions is in Section 4.7. 



3. The application of Parseval's identity can be replaced with the Hausdorff- Young in- 
equality, which leads to the conclusion ||/*/||oo > ll-^llp'^! where P < | and g > 4 
are conjugate exponents. Numerically, the values (p, g) = (|,4) appear to be optimal. 



However, Beckner's sharpening ||Bec75|| of the Hausdorff- Young inequality leads to the 



stronger conclusion ||/ * /||oo > C{q)\\K\\-'i where C{q) = |(1 - |)'?/2-i = ^ + 0(1). 
We have not experimented to see whether a larger lower bound can be obtained from 
this stronger inequality by taking g > 4. 
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4. Notice that we used the inequahty \\g\\2 < ||5'||oo||5'||i with the function g = f * f ■ This 
inequahty is sharp exactly when the function g takes only one nonzero value (i.e., when 
g is a. nif), but the convolution / */ never seems to behave that way. Perhaps for these 
autoconvolutions, an analogous inequality with a stronger constant than 1 could be 
estabhshed. Unfortunately, we have not been able to realize any success with this idea. 



although we believe Conjecture below. If true. Conjecture |4.4| implies the bound 
A(e) > 0.651e2. 

Conjecture 4.4. If f is a pdf supported on [—\, |], then 

ll/*/lloo ^ vr/2 



\\f*f\\l - log4' 



with equality only if either f{x) or /(— x) equals \J^^^ on the interval |x| < |. 



We remark that Proposition 4.2 can be extended from a twofold convolution in one 



dimension to an /i-fold convolution in d dimensions. 

Proposition 4.5. Let K he any continuous function on satisfying K{x) > 1 when x G 
[—2^, and let f be a pdf supported on [— Then 

iinioo > iini^ > 

Every subset o/[0, l]'^ with measure e contains a symmetric subset with measure (0.574575)'^£:^. 

Proof. The proof proceeds as above, with the conjugate exponents (gfrr? 2/;,) in place of 
(|,4), and the kernel function K{xi,X2, ■ ■ ■ ,Xd) = K{xi)K{x2) ■ ■ ■ K{xd) in place of the 



kernel function K(x) defined in the proof of Corollary [4.3| . The second assertion of the 
proposition follows on taking h = 2. m 



4.3 The Main Bound 

We now present a more subtle version of Proposition |4.2| . Recall that the notation n||a||p 
was defined in Eq. (|^). We also use to denote the real part of the complex number z. 

Proposition 4.6. Let 1 < m < m' < oo. Suppose that f is a pdf supported on [—\, |] 
and that K is even, continuous, satisfies K{x) = 1 for — ^ < x < and m'||-^||p > 0. Set 
M:=l-ir(0)-2E7="i'^(j)3^/(j)- Then 

\\f*f\\i> E + +2j2mm'~o{i) (9) 

\j\<m' \rn\\J^U/3/ j=i 

as m' —>■ OO. 

Proof. The first inequality follows from Parseval's formula 

j j \j\<m' 
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As in the proof of Proposition [4 .21, we have 



1 = J f{x)K{x)dx = j2f{j)K{-j) = J2 hmi-j) + Yl f(j)K{ 



\j\<m \j\>m 



Since K is even, K{—j) = K{j) is real, and since / is real valued, f{—j) = f{j)- We have 

m— 1 

1 = k{o) + 2 J2 Kmhj) + Yl fU)K{j), 

j=l \j\>m 

which we can also write as M = J2\j\>m fU)^U)- V •= J2\j\>m' 1-^0)1' note that 
rj = o(l) as m' — > oo since K is continuous. Taking absolute values and applying Holder's 
inequality, we have 

\M\ < J2 \fm{j)\ 

\m'>\j\>m J V'TJ'^lil^'" 

< f E ™ll^ll4/3 + 0(l), 

\TO'>|j|>m / 



which we recast in the form 

. f\M\-n{\\\^ f M \^ 

-o 1 



mi>\j\>m ^ mll-f^ 114/3 / VmH-^IU/S 

We add Yli\j\<ra both sides and observe that /(O) = 1 and |/(j)| > |3ft/(j)l to finish 

the proof of the second inequality. ■ 

Corollary 4.7. If f is a pdf supported on [— |, |], then 

E |/(j)r> 1-14915 -0(1) 

|j|<m' 

as m' oo 

Proof. The kernel function used in the proof of Corollary ^]3| has i^4(0) = 0.870250799 and 
1 117^4114/3 = 0.208784534. Now apply Proposition ^ with m = 1. ■ 

Corollary 4.8. Let f be a pdf supported on [—1, j], and set xi := 3?/(l). Then as m' ^ oo 



\\f*f\\i> E i/(j)r>i+2xt+( 

\j\<m' ^ 



4' 4J 

0.368067372 - 0.541553784x1^ ^ 
0.239175395 



ofl). 
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Proof. Set 



1 \^\<i 



1 _ (1 _ (4(1 _ a.))1.61707)0.546335 1 < < 1. 

Denote by Kq{x) the even piecewise linear function with corners at 

(0, + ^—r, + ^—-)] (where t = 0, 1, . . . , 10^). 

We find (using Proposition that KeiO) = 0.631932628, 7^6(1) = 0.270776892, and 

2||-^6||4/3 = 0.239175395. Apply Proposition [4.6| with m = 2 to finish the proof. ■ 



With m' = oo and Xj = ^f{j), the bound of Proposition |4.6| becomes 



^m— 1 j>i -x^ \ 4 m— 1 



m||^||4/3 / j=i 

This is a quartic polynomial in the Xj, and consequently it is not difficult to minimize, giving 
an absolute lower bound on ||/ * /II2. This minimum occurs at 



X 



^ ll/J-liVs 
ill-^ II4/3 



where {K{j)Y/^ is the real cube root of K{j). Consequently, 



which is nothing more than the bound that Proposition O gives with m = 1. Moreover, 

4 

I4/3' 



1 = sup ||(a + (l-«)i^n 

l||^||4/3 / 0<Q<1 



(the details of this calculation are given in Section [4.7.1|) so that Proposition ^]6| by itself 
does not give a different bound on ||/ * /II2 than Proposition [4.2| . However, we shall obtain 
additional information on f{j) in terms of ||/ * / ||oo in subsection 4^ below, and this infor- 
mation can be combined with Proposition ^]6|to provide a stronger lower bound on ||/*/||oo 
than that given by Proposition |4.2|. 



4.4 Some Useful Inequalities 

Hardy, Littlewood, and Polya |[HLP88|| call a function u{x) symmetric decreasing if u{x) 
u{—x) and u{x) > u{y) for all < x < y, and they call 

f^\x):=M{y: X {{t : f {t) > y}) < 2\x\} 
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the symmetric decreasing rearrangement of /. For example, if / is the indicator function 
of a set with measure n, then f^'^'^ is simply the indicator function of the interval (— f , f )• 
Another example is any function / defined on an interval [—a, a] and is periodic with period 
— , where n is a positive integer, and that is symmetric decreasing on the sub interval [—-,-]; 
then f^'^^{x) = /(^) for all x G [—a, a]. In particular, on the interval [— we have 
PQgsdr^27rjx) = cos(27rx) for any nonzero integer j. We shall need the following result. 

Lemma 4.9. 

J f{x)u{x) dx< j f"^\x)u"^\x) dx. 



Proof. This is Theorem 378 of [ HLP8 



We say that / is more focused than f (and / is less focused than /) if for all z G [0, |] 
and all r G T we have 

fT+Z pZ 

; J — z 

For example, f^'^'^ is more focused than /. In fact, we introduce this terminology because it 
refines the notion of symmetric decreasing rearrangement in a way that is useful for us. To 
give another example, if / is a nonnegative function, set / to be ||/||oo times the indicator 
function of the interval \— ^,,},, , wTn— 1; then / is more focused than /. 

Lemma 4.10. Let u{x) be a symmetric decreasing function, and let h, h be pdfs with h more 
focused than h. Then for all r G T, 

M.-rW.)<i.</ft(xWx)dx. 

Proof. Without loss of generality we may assume that r = 0, since if h{x) is more focused 
than h{x), then it is also more focused than h{x — r). Also, without loss of generality we may 
assume that h, h are continuous and strictly positive on T, since any nonnegative function 
in can be L^-approximated by such. 

Define H{z) = J'^h{t)dt and H{z) = P_^h{t)dt, so that H{\) = H{\) = 1, and 
note that the more-focused hypothesis implies that H{z) < H{z) for all z G [0, Now h is 
continuous and strictly positive, which implies that H is differentiable and strictly increasing 
on [0, |] since H'{z) = h{z) + h{—z). Therefore exists as a function from [0, 1] to [0, |]. 
Similar comments hold for H^^. 

Since H < H, we see that H~^{s) < H^^{s) for all s G [0, 1]. Then, since H~^{s) and 
H~^[s) are positive and u is decreasing for positive arguments, we conclude that u{H~^{s)) < 
u{H~^{s)), and so 

[ u{H-\s))ds< [ u{H-\s))ds. (10) 
Jo Jo 

On the other hand, making the change of variables s = H{t), we see that 

nl fH'Hl) 1-1/2 /. 

/ u{H-\s))ds= / u{t)H'{t)dt= / u{t){h{t) + h{-t)) dt = u{t)h{t)dt 



Jo Jo Jt 
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since u is symmetric. Similarly u{H ^(s))ds = Jju{t)h{t) dt, and so inequality (|ToD 
becomes J u{t)h{t) dt < J u{t)h{t) dt as desired. ■ 

4.5 Fourier Coefficients of Density Functions 

To use Proposition to bound A{e), we need to develop a better understanding of the 
central Fourier coefficients f{j) for small j. In particular, we wish to apply Proposition [4.6| 
with m = 2, i.e., we need to develop the connections between ||/ * /||oo and the real part of 
the Fourier coefficient /(I). 

We turn now to bounding |/(j)| in terms of ||/ * /||oo- The guiding principle is that if 
/ * / is very concentrated then || / * / ||oo will be large, and if / * / is not very concentrated 
then will be small. Green [ |Gre01| , Lemma 26] proves the following lemma in a discrete 

setting, but since we need a continuous version we include a complete proof. 

Lemma 4.11. Let f be a pdf supported on [—\, |]- For j ^ 0, 

I ?^-M2 ^ ll/*/l|oo . ( TT 

|/U)| < sm 



Vll/*/llc 

Proof. Let /i : T ^ R be defined by fi{x) := f{x — xq), with xq chosen so that is 
real and positive (clearly = |/(j)| and ||/ * /Joo = ||/i * /ilU)- Set h{x) to be the 

symmetric decreasing rearrangement of /i * /i, and h{x) := ||/ * /||oo-^(a;), where I{x) is the 
indicator function of [— ttttt^tm— , nin-^fu ]■ We have 

l/(j)r = hU f = /Wi(j) = [ fi* fi{x) cos(27rja;) dx < [ h{x) cos(27rx) dx 



by Lemma [4.9| . We now apply Lemma \i.lO\ to find 



|/(j)r < / h{x) cos{27rx) dx 



||/*/|Ucos(27ra;)rfx = ^^^^^sin^ 

l/(2|l/*/l|oo) ^ 



II/* /lie 



4.6 The Full Bound 



With the technical result of Section in hand, we can finally establish the lower bound on 
A{s) given in Theorem |Ll|(ii). 



Proposition 4.12. A{e) > 0.591389e2 for allO <e <1. 
Proof. Let / be a pdf supported on [—7,7], and assume that 



4' 4J 

11/ */|U< 1.182778. (11) 
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Set Xi = 3f?/(l) and X2 = 3^/(2). Since / is supported on [—\, |], we see that Xi > 0. By 
Lemma [4.11| , 



< xi < 0.4191447. (12) 

Corollary ^]8| with m' = 00, gives 

„, ,„ „, ,„ 4 /0.368067372- 0.541553784a;i\^ , , 

ll/*/IU>ll/*/lb>l + 2.t+( ^^^^^ . (13) 

Routine calculus shows that there are no simultaneous solutions to Inequalities (p^Tj), (O), and 



(y). Therefore ||/*/||oo > 1.182778, whence Lemma |0| implies that A{e) > 0.591389£2. ■ 
This gist of the proof of Proposition [4.12| is that if ||/ * /||oo is small, then 3?/(l) is 



small by Lemma ^.11| , and so ||/ * /H^ is not very small by Corollary [4.8| , whence ||/ * / ||oo 
is not small. If || / * / ||oo < 1.182778, then we get a contradiction. We can actually prove a 
meaningful result about ||/ * / II2 under the condition that ||/ * / ||oo is not much larger than 
1.182778. The following result will be useful in Section ||. 

Lemma 4.13. Let f be a pdf supported on [-\, \]. // 1.182778 < ||/*/||oo < 1.229837, then 
11/ * fWl > Yl ^ 21.922911 - 33.71194111/ * /|U + 13.676987||/ * f\\l - o(l) 

\j\<m' 

as m' ^ 00. 



Proof. The first inequality has already been shown in Proposition [4.6| . Set 

4 



B{xi) -.= 1 + 2x1 + 
so that by Corollary 



0.368067372 - 0.541553784xi 
0.239175395 



E i/(j)r>5W(i))-o(i). 

\j\>m' 



By hypothesis ||/ * f\\^ < 1.229837, so that by Lemma gTl 



But B{xi) is a decreasing function for xi E [0,0.47], so that 



5(W))>5(^^iM^sin(^)^ 

> 21.922911 - 33.71194111/ * f\\^ + 13.676987||/ * f\\l 
after a straightforward computation. 
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4.7 The Kernel Problem 



Let K be the class of functions K E L'^ satisfying K{x) > 1 on 
suggests the problem of computing 



-- -1 

4' 4J 



Proposition 4.2 



i/p 



inf 

KeK 



K\ 



inf 



In Proposition the case P = | arose, but using the Hausdorff- Young inequality in place 
of Parseval's identity we are led to consider 1 < p < |. Also, we assumed in Proposition f4.2| 
that K was continuous, but this assumption can be removed by taking the pointwise limit 
of continuous functions. 

As similar problems occur in ||CR'1]| and in [preOl 



we feel it is worthwhile to detail the 



thoughts and experiments that led to the kernel functions chosen in Corollaries |4.3| and 4.8 



4.7.1 Ad hoc Observations 

Our first observation is that if G G K, then so is K{x) := ^(G{x) + G(— x)), and since 
= < 1^0)1 we know that \\K\\p < \\G\\p. Thus, we may restrict our attention 

to the even functions in K. 

We also observe that decays more rapidly if many derivatives of K are continuous. 

This suggests that we should restrict our attention to continuous K, perhaps even to infinitely 
differentiable K. However, computations suggest that the best functions K are continuous 
but not differentiable at x = ^ (see in particular Section |4.7.4| and Figure 

In the argument of Proposition we used the inequality j f < J fK, which is an 
equality if we take K to be equal to 1 on [— |, |], instead of merely at least 1. In light of 
this, we should not be surprised if the optimal functions in IK are exactly 1 on [— ^, |]. This 
is supported by our computations. 

Finally, we note that if Ki G K, and > with Yli'^i ~ then '^■aiKi{x) G K also. 
This is particularly useful with Ki{x) := 1. Specifically, given any K2 G K with known 
||-f*^2||p (we stipulate ||-R'2||i = -^(0) =< 1 to avoid technicalities), we may easily compute the 
a G [0, 1] for which \\K\\p is minimized, where K{x) := aKi{x) + (1 — a)K2{x). We have 

\\K\\P = (« + (!_ a)k2{0)Y + (1 - a)\\\k\\l = (1 - (1 - a)Mf + (1 - aYN, (14) 

where we have set M := 1 — -^'2(0) and := iHKHj^. Taking the derivative with respect to 
a, we obtain 

p{l-aY-^(^M(^-^^—- My ^ -N 

the only root of which is a = 1 — ^^.[q^ljq/p (where ^ + ^ = 1). It is straightforward (albeit 
tedious) to check by substituting a into the second derivative of the expression (|I3|) that this 
value of a yields a local maximum for llA'Up. The maximum value attained is then calculated 

to equal N(^M'^ + N'^^^Y ^ ) which is easily computed from the known function K2. 
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Notice that when P = | (so g = 4), applying Proposition O with our optimal function 
K yields 



ll/*/ll^>ll^ll4/3=(^(^' + ^') 



whereupon we recover the conclusion of Proposition [4.6| with m = 1. 
4.7.2 Trigonometric Polynomials 

We wish to identify families of functions that are at least 1 on [— |] and whose Fourier co- 
efficients have small norm. Natural candidates are functions which have many Fourier 
coefficients equal to 0. In this section we consider trigonometric polynomials K{x) = 

Yl^=-m-^U)^^^^^^ of degree m. 

Montgomery [[Mon94| , Chapter 1] defines the Selberg polynomials S';|^(a, /3, x) and shows 



that 5*+ (a, x) > X[a,i3]ix) for all x, provided that a < f3 < a + 1; moreover, these functions 
are (in some senses) optimal majorants for X[a,i3]{x) among all trigonometric polynomials 
of bounded degree. These provide a natural family for investigation. 
We are concerned with [a,P] = We have for instance 

5 4 2 2 

'^2^(-i, h^) = Q + + ^) cos(27rx) - - cos(47rx), 

which satisfies 5'2^(— I, \, x) — > 1 when x G [— |, |], and 

inf ||(a + (1 - a)iSU-i h^) - ^r\U/s > 0.990. 

However, L{x) := 2cos(27rx) — cos(47rx) G K, and 

inf ||(a + (1 - a)(L(x))^\U/s < 0.989. 

0<a<l / \ \ II / 

Thus, even among trigonometric polynomials of degree 2, the Selberg polynomials are not 
optimal. In general, we have been unable to identify the degree-m trigonometric polynomial 
K{x) that is in K and for which Yl]jL-m minimized. 

4.7.3 Wavelets 

We can give an exact, finite expression for the p-norm of the Fourier coefficients of some 
large classes of functions. Sums of Haar wavelets give the simplest theoretical instance and 
the largest class of functions. 
Define 

fl <x < |, 
iP{x):=l-l i<x<l, 
I otherwise. 
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and ipm,n{x) '■= 2~'^^'^ip{2"^x — n). It is well-known that the Haar Wavelets {iprn,n- n G Z} 
form an orthonormal basis of the subspace of L^(]R) consisting of functions that have integral 
0. By the comments in Section |4.7.1| , 



-1/4 



inf Ilii'L/q = inf 
K&K ' KeK \ , WKW"^, 

JK=0 V 1 11^^114/3, 

SO that the J K = restriction is not a substantial restriction. 

It follows that every even function K E K with J K = and x G [— |, |) =^ K{x) = 1 
can be written in the form 

oo 

K{x) = V2{tpifi{x) +^i,o{-x)) + ^an (^2+Liog2nj,n(a;) + V'2+Liog2riJ,n(-a;)) . 

n=l 

Since this expression is linear, we can give the Fourier coefficients of K in terms of the easily 
computable Fourier coefficients of the ipm,n and in terms of the parameters Truncating 
the infinite sum at A^, we obtain a reasonably large family of possible functions K for which 
lli^llp can be computed quickly enough to numerically optimize ai, . . . , a^- Graphs of the 
optimal K{x) for p = | and various values of N are displayed in Figure |[ Note that if 
these wavelet-based functions are converging to some limit function in K, that limit function 
certainly does not seem to be differentiable at 

While the exact form of this expression for ^K^p is not difficult to compute, we suppress 
the details as they are very similar to the expressions computed in the next section. 



4.7.4 Piecewise-Linear Functions 

More useful computationally is the class of continuous piecewise-linear even functions whose 
vertices all have abscissae with a given denominator. Let Ci^^a) := Y1^=q{^ + '^)~^ denote 
the Hurwitz zeta function. If v is a vector, define Ap(v) to be the vector whose coordinates 
are the pth powers of the absolute values of the corresponding coordinates of v. 

Proposition 4.14. Let T he a positive integer, n a nonnegative integer, and p > 1 a real 
number. For each integer < t < T , define xt := \ + and let yt be an arbitrary real 
number, except that yo = 1. Let K{x) be the even function on T that is linear on [0, |] and 
each of the intervals [xt-i,Xt] (1 < t < T), satisfying K{0) = 1 and K{xt) = yt (0 < t < T). 
Then 

n\\K\\p = i2ApidA)-z)'/P, 

where d is the T- dimensional vector d = {yi — yo,y2 — yi, . . . ,yT — Vt-i), A is the T x 4T 
matrix whose {t, k)-th component is 

Atk = cos(27r(n + k — l)xt) — cos(27r(n + k — l)xt^i), 

and z is the AT -dimensional vector 

z = (8r7r2)-^'(C(2p, ^), C(2p, ^), . . . , C(2p, i±f^)) . 
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Figure 4: Optimal kernels generated by Haar wavelets 
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Proof. Note that 

K{-j) = K{j) 



1/2 



K{u) cos(27rju) du 



1/2 
1/4 



2 / cos(27rjn) (iw + 2 / (mju + 6j) cos(27rjn) du, 

Jo J xt-\ 



where rrit and ht are the slope and intercept of the hne going through {xt-i,yt-i) and 

2-2 

{xt,yt)- If we define C(j) := ^^^K{j), then integrating by parts we have 



vr^j^ / 1 



T \ 2nj 



sin [271 ju) 



1/4^ 



+ 2^ I — — — sin(27rjnj 



T 



t=i 



2vrj 



2 ,-2 



E 



mi 



T f^(2vrj)^ 



■ cos(27rjn) 



(15) 



The first term of this expression is 



2^ sin (vrl) + ^ ((mtXt + 6t) sin(27rjxt) - (rritXt-i + fej sin(27rjxt_i)) 

^ t=l t=0 ^ 

Since nit+iXt+bt+i = Ut = nT-tXt+h and Xq = xt = |, this entire expression is a telescoping 
sum whose value is zero. Eq. ( [T3| ) thus becomes 



2„-2 



E 



cos(27rjM) 



Xt-1 



X] (z/t - yt-i) (cos(27rjxt) - cos(27rjXi_i)) 



(16) 



t=i 



using = ^ = AT{yt — yt-i)- Each a;t is rational and can be written with denominator 
4T, so we see that the sequence of normalized Fourier coefficients C{j) is periodic with period 
AT. 

We proceed to compute n||-f^||p with n positive and p > 1. 



oo oo 



\3\>n 



2T 



E 



]=n 



\C{3)Y 

A2p 



j=n 



C{j) 



2T 
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Because of the periodicity of C{j), we may write this as 



(„iiii-ii,)' = 2 ('^y rf" ico)rf:(4Tr + 

^ ' \ j=n r=0 

= \WW) ^ l^(^-)rC(2p,^)], (17) 



which concludes the proof. 



Proposition |4.14| is useful in two ways. The first is that only d depends on the chosen 



values Ut- That is, the vector z and the matrix A may be precomputed (assuming T is 
reasonably small), enabling us to compute quickly enough as a function of d to 

numerically optimize the yt. The second use is through Eq. ([17|) . For a given we set 
yt = K{xt), whereupon C{j) is computed for each j using the formula in Eq. ([T6|) . Thus we 
can use Eq. (|l^ to compute with arbitrary accuracy, where Ki is almost equal to 

K. We have found that with T = 10000 one can generally compute quickly. 

In performing these numerical optimizations, we have found that "good" kernels K{x) G 
K have a very negative slope at x = | (e.g., see Figure §). Viewing graphs of these 
numerically optimized kernels suggests that functions of the form 



Irl < - 



which have slope —00 at x = may be very good. (Note that the graph of i^2, 1/2(2^) 
between ^ and | is the lower half of an ellipse.) More good candidates are functions of the 
form 

_/i \x\<h 



(4a;-l)<=2 J J 4 ^ l-^l _ 2' 

where 61,62, and 63 are positive. We have used a function of the form K^^^^^ in the proof of 
Corollary ^]8| and a function of the form i^^ei,e2,e3 in the proof of Corollary [4.3| . 



4.8 A Lower Bound for A(i) 

We begin with a fundamental relationship between 3?/(l) and 3fJ/(2). 
Lemma 4.15. Let f he a pdf supported on [—\, Then 

2(3?/(l))' - 1 < 3fJ/(2) < 2(3?/(l)) - 1. 
Proof. Since L2{x) := 2cos(27rx) — cos(47ra;) is at least 1 for — ^ < x < ^, we have 

1 < / f{x)L{x) dx=J2 fU)H-j) = 2(3fJ/(l)) - 3f?/(2). 

j=-2 
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Rearranging, we arrive at 3?/(2) < 2(3fJ/(l)) — 1. 

We give two proofs of the inequality 2(3fJ/(l)) — 1 < 3fJ(/(2)), each with its own advan- 
tages. 

First proof: Since /(x) and cos(27rx) are both nonnegative on [— ;|,|], the Cauchy- 
Schwartz inequahty gives 

1/4 \ ^ /.1/4 



2(SR/(1))2 - 1 = 2 / f{x)cos{2-Kx)dx\ - f{x)dx 



1/4 / J-1/4 

"1/4 \ / /•1/4 \ ^1/4 

<2( / f{x)dx] / /(x) (cos(27rx))^ c/x - / f{x)dx 

/-1/4 J \J~1/A J J-1/4 

1/4 pl/i 

/(x) (cos(27rx))^ (ix — / f{x)dx 

1/4 J-l/A 
1/4 

/(x) (2cos^(27rx) - l) dx 

1/4 

1/4 

/(x) cos(47rx) (ix = 3?/(2). 

1/4 

Second proof: Set Lb(x) = 6cos(27rx) — cos(47rx) (with 6 > 0) and observe that for 
— ^ ^ X < ^, we have Lh{x) < 1 + ^. Thus 

r ^ 

1 + f > / f{x)U{x) dx=J2 fij)UH) = b^fil) - 3?/(2). 

i=-2 

Rearranging, we arrive at 5R/(2) > 6(3?/(l)) - 1 - ^. Setting 6 = 43fJ/(l), we find that 
3fJ/(2) > 2(3?/(l))2 - 1. . 

The first proof may be adapted to also give 4(9^/(1))'^ — 33?/(l) < 9^/(3). The proof does 
not immediately extend to higher coefficients. The second proof can be strengthened with 



the additional hypothesis that / be an nif. We take advantage of this in Proposition 4.16 



From the inequality 3fJ/(2) < 29^/(1) — 1 (Lemma [4.15D one easily computes that 
max{|/(l)|, 1/(2)1} > |, and with Lemma [4.11| this gives 

1 ||/*/||oo . / TT 

< — — sin ' 



9- vr \\\f*fl 

This yields ||/*/||oo>l.ll, a non-trivial bound. If one assumes that / is an nif supported 
on a subset of [—\, |] with large measure, then one can do much better than Lemma [4.15| . 
The following proposition establishes the lower bound on A{e) given in Theorem |l.l| (iii). 

Proposition 4.16. Let f be an nif supported on a subset of[—j, ^] with measure e/2. Then 

||/*/||oo > 1.1092 + 0.176158 5 

and consequently 

A{6) > 0.5546£2 + 0.088079£^ 
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Proof. For e > |, this proposition is weaker than Lemma and for e < | it is weaker 



than Proposition |4.12| , so we restrict our attention to | < e < |. 



Let b > —1 be a parameter and set Li,{x) := cos(47rx) — 6cos(27rx). If we define F :- 
max{5R/(l), -3f?/(2)}, then 

j fix)U{x) dx = 3?/(2) - 6gfJ/(l) >-{h+ l)F 



on the one liand, and 



■e/i 

f{x)U{x) dx < I f^\x)U"^'{x) dx= I ^ U'^^'ix) dx 



e 

e/i 

on the other, where L}j^^'^{x) is the symmetric decreasing rearrangement of Kh(x) on the 
■- -1 

4' 4J 



interval [—7,7]- Thus 



1 2 r/^ , 

' b 



b+le 



F > I U"^Hx)dx. 



e/4 



The right-hand side may be computed exphcitly as a function of e and b and then the value 
of b chosen in terms of e to maximize the resulting expression. One finds that for £ < ^, the 
optimal choice of b lies in the interval 2 < 6 < 4, and the resulting lower bound for F is 



^ ^ 3 cos(f ) + sin(f ) - ^3 + 4 cos(f ) + 2 cos(7r£) - sin(f ) 
~ 7r£:cos(^) + 7r£:sin(^) 

From Lemma [4.11| we know that < sin ■ We compare these bounds on F 

to conclude the proof. Specifically, 

< Miii^ f < JL , (« + y - /lu - 1) . (18) 

TT VII/*/lloo/ Stt 127r 

where the expression on the right-hand side of this equation is from the Taylor expansion of 
f sin(^) at xo = |, and 



F^ > 



> 



3 cos(f ) + sin(f ) - V3 + 4 cos(f ) + 2 cos(7r£) - sin(f 
ire cos(^) + ire sin(^) 

-8 (-3 - 72 + 73 + 76) 



(96 (-3-72 + 73 + 76) -4(972-1073 + 76)71) 

37r2 ' ^^^^ 

where the expression on the right-hand side is from the Taylor expansion of the middle 
expression at = |- Comparing Eqs. (|18D and (pJ]) gives a lower bound on ||/ * / ||oo, say 
11/ * /lloo > Ci + C2£ with certain constants Ci,C2. It is easily checked that Ci > 1.1092 and 
C2 > 0.176158, concluding the proof of the first asserted inequahty. The second inequahty 
then follows from Lemma |4.1| . ■ 
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5 Upper Bounds for R{g, n) Arising from A(£) 



5.1 Inequalities Relating A(£) and R{g^n) 

A symmetric set consists of pairs {x,y) all with a fixed midpoint c = If there are few 
pairs in E X E with a given sum 2c, then there will be no large symmetric subset of E 
with center c. We take advantage of the constructions of large integer sets whose pairwise 
sums repeat at most g times to construct large real subsets of [0, 1) with no large symmetric 
subsets. Recall the definition (^) of the function R{g,n). 

Proposition 5.1. For any integers n > g > 1, we have A ( '^^^'"^ ) < ^. 
Proof. Let C {1, 2, . . . , n} be a B*[g] set with \S\ = R{g,n). Let 



n n 



as in Eq. (^); it suffices to show that the largest symmetric subset of ^4(5*) has measure 
at most -. Notice that the set ^4(5) is a finite union of intervals, and so the function 
A(y4(S')n(2c — ^4(5"))), which gives the measure of the largest symmetric subset of ^(5') with 
center c, is piecewise hnear. (Figure contains a typical example of the set ^(5") portrayed 
in dark gray below the c-axis, together with the function \ (^A{S) fl (2c — A(S'))) shown as the 
upper boundary of the light gray region above the c-axis, for S' = {1, 2, 3, 5, 8, 13}.) Without 
loss of generality, therefore, we may restrict our attention to those symmetric subsets of A{S) 
whose center c is the midpoint of endpoints of any two intervals (^^, ^). In other words, 
we may assume that 2nc G Z. 



\{A{S) n {2c-A{S))) 




Figure 5: A{S), and the function \{A{S) n (2c - A{S))), with S = {1, 2, 3, 5, 8, 13} 



Sl 



Suppose u and v are elements of ^4(5) such that = c. Write u 
V = ^ — ^ + y for integers si,S2 G S and real numbers x,y satisfying \y\ ^ 2„- 
may ignore the possibility that nu or nv is an integer, since this is a measure- zero event for 



2^ + a; and 
< ^. (We 
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any fixed c.) Then 2nc = n{u + f ) = Si + S2 — 1 + n{x + y), and since 2nc, Si, and S2 are 
all integers, we see that n{x + y) is also an integer. But \n{x + y)| < 1, so x + ?/ = and 
Si + S2 = 2nc + 1. 

Since S* is a B*[g\ set, there are at most g solutions (si,S2) to the equation si + S2 = 
2nc+ 1. If it happens that si = S2, the interval (^^, 77) (a set of measure ^) is contributed 
to the symmetric subset with center c. Otherwise, the set {^^^, ^) U (^^^, ^) (a set of 
measure -) is contributed to the symmetric subset with center c, but this counts for the two 
solutions (si, S2) and (s2,Si). In total, then, the largest symmetric subset having center c 
has measure at most -. This establishes the theorem. ■ 

n 

Using Proposition |5]l|, we can translate lower bounds on A^e) into upper bounds on 
R{g,n), as in Corollary |5.2| . 

Corollary 5.2. If 6 < info<e<i A(£)/e^, then R{g,n) < S~^^'^^/gn for all n> g >1. 



We remark that we may take 5 = 0.591389 by Proposition |4.6| , and so this corollary 
implies that R{g,n) < 1.30036^^^, which is one of the assertions of Theorem |1.4 



Proof. Combining the hypothesized lower bound A.(e) > Se with Proposition |5.1| , we find 
that 



2 

n 



^I^ R{g,n) y ^^I^ R{g,n) ^^ 

which is equivalent to R{g, n) < 5~'^/'^^/gn. 

We have been unable to prove or disprove that 



lim lim — — ^ = | inf 



g^oon—*oo ^ gTL V0<£<1 



-1/2 



i.e., that Corollary |5.2| is best possible as (7 — 00. At any rate, for small g it is possible to do 
better by taking advantage of the shape of the set A{S^ used in the proof of Proposition ^ 
This is the subject of Section p^ . 



Proposition |5.1| provides a one-sided inequality linking A{e) and R{g,n). It will also 
be useful for us to prove a theoretical result showing that the problems of determining 
the asymptotics of the two functions are, in a weak sense, equivalent. In particular, the 
following proposition implies that the trivial lower bound A{e) > ^e^ and the trivial upper 
bound R{g, n) < ^/2gn are actually equivalent. Further, any nontrivial lower bound on A{e) 
gives a nontrivial upper bound on R{g, n), and vice versa. 

Proposition 5.3. A{e) = inf{f : n > ^ > 1, > e} for all < e < 1. 

Proof. That A{e) is bounded above by the right-hand side follows immediately from Propo- 
sition |5.1| and the fact that A is an increasing function. For the complementary inequality, 
let S C [0, 1) with X{S) = e. There exists a finite union T of open intervals such that 
X{S o T) < rj, and it is easily seen that T can be chosen to meet the following criteria: 
T C [0, 1), the endpoints of the finitely many intervals comprising T are rational, and 
A(T) > 6. Choosing a common denominator n for the endpoints of the intervals comprising 
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T, we may write T = IJmeAf [^^^' ^) (^P ^'^ ^ finite set of points) for some set of integers 
M C {1, . . . ,n}; most likely we have greatly increased the number of intervals comprising 
T by writing it in this manner, and M contains many consecutive integers. Let g be the 
maximal number of solutions (mi, G M x M to mi + 1712 = k a.s k varies over all inte- 
gers, so that M is a B*[g] set and thus \M\ < R{g, n) by the definition of R. It follows that 
£ < A(T) = \M\/n < R{g, n)/n. Now T is exactly the set A{M) as defined in Eq. (^); hence 
D{T) = ^ as we saw in the proof of Proposition Therefore by Lemma |3.4] , 

D{S) > D{T) -2r] = f^-2r]> inf{f ■.n>g>l, > e} - 2r]. 

Taking the infimum over appropriate sets 5* and noting that rj > was arbitrary, we derive 
the desired inequality A(e) > inf{^ '■ n > g > 1, > g 



5.2 Upper Bounds on R{g^ n) and p{g) 



The bulk of Theorem |L2| follows immediately from Proposition which is proved in this 
section. 

Let S' C {1, 2, . . . , n} be a B*[g] set. We call the function 



fix) :-- 



2n s-1 1 



— ^ < X < ^ — T-for some s E S, 



|5|' 2n 4 — 2n 4 

0, otherwise 



the nif corresponding to 5*. We think of 5*, and consequently /, as depending on n (many 
readers may prefer that we write S'„ and /„, but this is neither customary nor sufficiently 
brief). For example, when we write "Let S* C {1, 2, . . . , n} be a B*[g] set with l^l > 0.7 ^/gn'^ , 
we mean "For each n G Z"*", let S = Sn C {1, 2, . . . , n} be a B*[g] set with l^l = l^nl > 
0.7^." 

Note that / is piecewise constant, so that / * / is piecewise linear. Furthermore, the 
corners of / * / are only at the points ^Z, and if we define 

r{t):=i^{{x,y)eS':x + y = t}, (20) 

then / * f{^) = In particular, ^ is a B*[g] set if and only if ||/ * f\\^ < 

Proposition 5.4. Let cf) := 1.14915 he the constant appearing in Corollary and let 
9q := 21.922911, 9i := —33.711941, and 62 '■= 13.676987 he the constants appearing in 
Lemma \4-i!!\ - Then: 

i. R{g, n) < ■\/2/(/)y' {g — l)n + | for n > g > 2 and g odd; 
ii- l>{.g) ^ a/2/0 f 1 — for g > 2, and if moreover 'p{g) > 1.275237 then 



2v/3% 

Pi9) < / , yi- -- 

(3(1 - ^1)^ - - -^(3(1 - e^)g - v^) - ?>Qeoe2g''\ 
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■ //fi' > 2 is odd, then p{g) < a/2/0 ^1 — j ; o^nd if moreover p{g) > 1.275237, 

2x/3^ 



III 

then 



Pi9)< 



(^3{l-9^)g-V3-3-^J{3{l-9^)g-V3-3Y~ 36^0^2^?^) 



1/2' 



We begin with a simple lemma. 

Lemma 5.5. Ifh,p,q are nonnegative functions with h = p + q and \\h\\oo > 
then 

> II^II^-IHI^ I 

— II 7 II , II II "I" ILr I oo- 

' 1 + Iblli 



Proof. We have 



2 II I l|2 
2 = Ib + ?ll2 



2 + ll(2p + g)g||i 



< \\p\\l + \\2p + qUq\U 

= lbll2 + (ll^lli + lblli)lklloo 

<lbll2 + (l|/^lli + lblli)(l|/^lloo-lblloo). 



We will use this lemma with /i = /*/(/ a pdf), and p chosen so that ||j9||i 0, 
Ibib — > 0, IIpIIoo 7^ 0. In this case, we have the inequality 

ll/*/l|oo>||/*/l|^+|b||oo 

which is stronger than Holder's Inequality: ||/ * /||oo > ||/ * /||2- 

Proof of Proposition 5^(§j. The idea of the proof is that even though / * / might take 



values near it does so only on a set of small measure, and away from that small set 

it is bounded by ^^^gp''" - If the pair (si,S2) contributes to r{k), then so does (s2,si), and 
therefore r{k) is odd if and only if /c G {2s: s G S}. There are only l^l such integers k, 
and no two are consecutive. Consequently, if / * /(^) = then / * /(^^) < and 

/*/(l^)<^- 

We put this idea into effect by writing / * /(x) = p{x) + q{x), where p represents the 
small contribution to r{k) of pairs (s, s) and q is the remaining majority of / * /. More 
precisely, let p{x) := J^ses'^i-'^ ~ ^ ~ i)' where T(x) is the "tent function" 



T{x) 



|^(l-2n|x|), ii\x\<^, 
0, otherwise, 



and let q{x) := f * f{x) — p{x). (See Figure || for an illustration of a typical example, again 
using the 5* [3] set S = {1, 2, 3, 5, 8, 13}.) Because of our judicious choice of the peaks of p, 
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Figure 6: The decomposition / * /(x) 
S = {1,2,3,5,8,13} 



p{x) + q{x), where / is the nif corresponding to 



both p and q are nonnegative and ||/*/||oo > ||'?||oo + lbl|oo- We compute directly from the 



definition of p that = |^|||T||] 



Lemma [5.5| with h = f * f gives 

ll/*/l|oo> 



2\S\-' 



|5|||T||H7i^,and |b||oo = ||T|| 



_ _2n_ 

oo — |5|2 



1 + 



(21) 



Using the inequahty ||/ * /II2 > from Corollary |4l7| , the inequality || / * / ||oo < j§^, 



and the above computations of 



l5 \\P\\2, 



and 



we can deduce from Eq. (^) that 



\S\'< 



2{g-l)n {g - l)n + 2n/3 2{g - l)n gn 



+ 



< 



from which it follows that 15*1 < 



2(3-l)n , 1 
d> 3" 



Lemma 5.6. // / is the nif corresponding to S* C {1,2, . . . ,n}, and m' —>■ 00 with m' 
' 



\j\>m' 



> 



2 n 



-0(1). 
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Proof. Note that for j ^0, 

Jf{j)=jj f{x)e-'-^^^dx 



s 1 



1 \ /"an 4 

^ ' /s-l 1 

S&S '^-2^~4 

J2 (^e-^'^^^^^-i) - e-'^'^(^-3)) . 



ses 



2m 

In particular, if we set c(j) := j then c(j) = c(j + 2n) (provided neither j nor 

J + 2n is zero) and c(j) = c{—j). Exploiting this periodicity is the heart of this lemma. 
We define 

J :={m' + l,m' + 2, . . . ,n} U {2n - m' ,2n - m' + 1, . . . ,2n - 1} 
f :=[m' + l,m' + 2n]\ J, 

and for j G J we set 

{2n — j, m' + 1 < j < n; 

2n, j = n; 

An — j, 2n — m' < j < 2n. 

Observe that c(j) — c{j') and J' — {j' : j e J}. Also, for almost all j e J we have / = 2n—j, 
since m' = o{n). 
For all p > 1, 

oo 

|j|>m' j=m'+l 



2 E 

j=m'+l 



m'+2n oo / / .\ 

c(j) 



= 2 E E 

j=m'+l fc=0 
m'+2n 



j + 2n-k 



= 2 E f c(P.i.) 

j=m'+l ^ ^ 

= 2E(l^)'(C(P'4) + C(p,fe)), (22) 



where a) := X]fclo(^ + ^)~* Hurwitz zeta function. 

We shall use Cauchy's inequahty in the form > (^^A) ^ -^th 



2n , 
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C(4,i) + C(4,£)) 



and 



, C(2,|,) + C(2,£) 

~ / .X 1/2 • 



(C(4,4) + C(4,£)) 
For notational convenience set 



^ ' ^ 2n^ ^ 2n^ CiA ^) + C(4: ^' 



2 



and note that since m' = o{n), 

lim Z(m',n) = lim — V + 7^.)) 



"^(C(2.aHC(2.1-a))^,^_ T V^da=^, (23) 



„ C(4,a)+C(4,l-a) Jo 2 + cos(2ia) 2 

Therefore from Eq. (^) with g = 4 and Cauchy's inequahty, 



(E,« (*)'(C(2.i)+C(2.|;)))' 



\j\>m' jeJ 



E,eJ (C(2, 4) + C(2, £))'(C(4, ^) + C(4, £))-' 



2nZ{m', n) 
{\\m-Eyi<m'\Hj)\' 



(24) 



4n Z{Tn', n) 

the last equahty following from Parseval's formula. Trivially, < /(O) = 1, and since 

2n 
151- 



/ is the nif corresponding to S, we have ||/||^ = ||/||^A(supp(/)) = ||/||oo = tu\- Since 



m' = o{n/\S\), Eq. becomes 



I frilly 1 liZ^lllO 2 n 



Proof of Proposition |5!J(|z^. Let S* C {1, 2, . . . , n} be a B*[g\ set with 15*1 = R{g, n) and let 
/ be the corresponding nif. We shall use the inequality 

i \j\<m' \j\>m' 
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We use Proposition |]7 and Proposition O to bound the sum over small and we use 
Lemma ^]6| to bound the sum over large \j \ . 



We are now prepared to prove Proposition p.4|(P). We have 



2 2nn 
^ = hminf ^ > hminf ||/ * /lU > hminf ||/ * f\\l 

p [g J n^oo \o\ n^oo n—>oo 



liminfE|/(j)r = liminf f 

j ^ \j\<'m' \j\>m' ^ 



We see from Corollary and Lemma that 
2 ,..„/, 2 n 



/ An \ 

alimijf (0-0(1) + -^^-0(1))=^ + 



1 



Solving this inequality for p{g) yields 

/2 / 1 
p(^?)< W-l 1- 



9VS 



1/2 



If p{9) > 1.275237, then ||/ * f\\^ < 1.229837 for infinitely many n, and we can use 
Lemma [4.13| instead of Corollary [4.7| . Setting 



F := liminf ||/ * f\\^ < liminf 



2gn 



we have 



F> (hminf J2 \fijr+ E 

\j\<m' \j\>^' 

> hminf k + ^ill/ * /Hoc + ^211/ * fill - o(l) + /- I 



2 n 



oil' 



>eo + OiF + e2F^ 



whence 



Pi9)<\lp< 



(^3(1 - - V3 - y/(3(l-^^i)(7-V3) '-36^0^2^72^ 



1/2- 



Proof of Proposition |5.^ ( |m^ / We combine the ideas of parts (|) and As in the proof of 
Proposition ^]^(||), we let S' C {1, 2, . . . , n} be a B*[g] set with \S\ = R{g,n) and / be the 
corresponding nif. 

If g is odd, then (defining p and q as in the proof of Proposition |5.4| (|D) Eq. (^I]) is valid, 
and IIpIIi — > 0, ||p||2 0, and 



2n 

\co — Jsp 



2 
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We find 



2 2gn 
lim 



J>\g) ™ \S\^ 

= lim II/* /lie 



> lim 



ii/*/iinbii^ 



n— +00 1 -|- 

(E|,i<™'l/(j)l' + E|,wl/(j)r)-lblli 



lim 



i + INI 



TTpk + """^ 

21 2 
+ ^=0^ + 



1 + 1/V3 



Solving this inequality for ]f'{g) yields 



p(^7)<a/- 1- 



2 / 1 + 1/V3\ 



1/2 



V ■ 



If p{.g) > 1.275237, then ||/ * /||oo < 1.229837 for infinitely many n, and we can use 
Lemma [4.13| . We have: 



2 _ 2n/3 







II/* /II. 



E \f(j)\' >Oo + 0i\\f*f\\oo + e2\\f* fill - 0(1) 

\j\<m' 

Thus, again writing F = liminf„^oo 11/ * /|| 00, we now know 
F:=liminf ||/*/|U 

n—>oo 

(E|,|wl/(j)r + E|,wl/(j)r)-IHIi 

> lim inf — h 

n— »oo 1 -)- 

> ^0 + OiF + e^F^ + J— + -. 

gVs 9 
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Isolating F, we obtain 



F > 



-200 



and so 



~ / I 5 

f3(l-^i)(7- y^(3(l-0i)(7- V3-3) -36^0^2^?^ 



which concludes the proof. 



5.3 Ubiquity of Repeated Sums in B*[g\ Sets 



The method of proof of Proposition |5.4| (pD can be adapted to yield more information about 
the number of representations of integers as sums of pairs of elements from B* [g\ sets. The 
following theorem gives a quantitative statement of the fact that, if S" is a dense enough 
B*[g\ set, then there is a substantial number of integers t such that r(t) is large, where r{t) 
is defined in Eq. (pO|). 

Theorem 5.7. Let 5 C {1, . . . , n} he a B*[g\ set, and let < L < g be a real number. The 
number of integers t such that r{t) > L is at least 

(j)\S\^ - 2Ln\S\^ 
n{g-L){g + 2L)' 



where (j) := 1.14915 is the constant appearing in Corollary |^. 7. 

We give an illustration of this theorem after the proof. 

Proof. Let / be the nif corresponding to S. We note again that || / * / ||i = ||/||i = 1 and 
11/ */||2>0 by Corollary 0. 

Now let Q be the number of integers t between 1 and 2n such that r{t) > L. Our aim 
is to show that Q is large if IS"! is large enough and L is small enough. Define the functions 
p{x) and q{x) by 

q{x) = min |/ * /(x), ^| , p{x) = f * f{x) - q{x). 



We have 



< 11/ * /ll^ = lb + = Ibll^ + ||(2p + g)g||i 

< Ibll2 + l|2p + g||i||g||oo 
= lbll2 + (i + lblli)lklloo, (25) 
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where the latter inequality is by Holder. By its definition, we have ||g||oo < ^fjt- To bound 



1 and ||p||2, we look more closely at the function p. 
Suppose r(i(:+l), r(t + 2), . . . , r{t+K) are all greater than L, but that r{t) and r{t+K+l) 
are at most L. Then the graph of / * / between and lies under the trapezoid 

with vertices and The area of this 

trapezoid, which bounds the contribution to ||p||i from x in the interval [ *"^^''' ; *"^^~'^ ] ; 
equals "^^j^- Since the numbers K from all such intervals sum to Q, we have established 
the bound IIpIIi < 

Similarly, the graph of (/ * fY between ^~2~^ and lies under the trapezoid with 

vertices (i^, ^), (^, ^), (^±^, ^), and f^, ^) (since the graph of 
/*/ is piecewise convex). The area of this trapezoid, which bounds the contribution to ||p||2 
from X in the interval equals ^""^u^ ' ■ Since the numbers K from all such 

2 / Qn{g^-L^) 



\2 U„-(-„,„„v. t-n-1 t+K-n 



intervals sum to Q, we have established the bound \\p\\\ < 
Inserting these bounds into Eq. (|25|), we conclude that 



|5|4 



or equivalently 

as desired. ■ 
We remark that, using the techniques from the proof of Proposition |5.4| (P), the constant 



1.14915 in the statement of Theorem |5.7| can be replaced with 1.182778 (provided one is 
concerned only with the case n — >■ oo). 

To give an illustration of this theorem, define 7 = \S\/y/gn, a = L/g, and k = Q/2n. 
Then the inequality (^) becomes 

7^(0.5745757^ -g) 
- (l-a)(l + 2a) ^ ^ 

For example, take 7 = 0.7 and a = 0.25, so that the right-hand side of this inequality is 
greater than 0.0137382 > ^. Theorem |1.3| tells us that for every g > 2 except g = 3 and 
possibly g = 5 and g = 7, there exists a B*[g] set S contained in {1, . . . ,n} with at least 
0.7y^^ elements, at least when n is large. For every such set S, the inequality ( p7|) asserts 
that at least || of the integers t between 1 and 2n have at least | representations t = si + S2 
with Si,S2 G S. 

To give a basis for comparison for Theorem [5.7| , we note that the simple argument 
\S\' = J2^it)= E rit)<i2n~Q)L + Qg 

t t: r(t)<L t: r{t)>L 

gives Q > ^'^gl^"'^ , which translates in the above notation into 
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Figure 7: Comparing the bounds of Eq. P5|) and Eq. (P?]) 

In Figure Region IV corresponds to the pairs (7, a) for which neither Eq. (|27D nor Eq. ( ^Sf ) 
gives a nontrivial bound on k (the trivial bound is k > 0). In Region I, which contains our 
example point (0.7, 0.25), Eq. ( ^7]) gives a nontrivial bound while Eq. (|28|) does not. Eq. ( pSD 
gives a nontrivial bound in Regions II and III, but in Region II the bound in Eq. ( pTj) is 
better. That is, the simple bound in Eq. ( pSD is better than Eq. ( pT)) only in Region III. 



5.4 The Uniform Distribution Hypothesis 



We say that a sequence (5'„)^i of sets of positive integers becomes uniformly distributed if 
the discrepancy of Sn goes to as ri — 00. That is, (S'„)^^ becomes uniformly distributed 
if 

(/3 - a) 



lim sup 



sup 

0<a</3<l 



Sn 



0, 



where M„ denotes the largest element of Sn- 

It has long been known |pi)rd44| , |Cho44|| that R{2,n) ~ ^/n. In 1991, Erdos and 
Freud [[EF91|| proved that if C {1, . . . , n} is a sequence of B*[2] sets with \Sn\ ~ \/n, 
then (Sn) becomes uniformly distributed. We are led to make the following conjecture. 

Conjecture 5.8. Let g > 2 be an integer. Suppose that C {1, . . . , n} zs a sequence of 
B*[g] sets with \Sn\ ^ R{g,n). Then (S'„) becomes uniformly distributed. 

Independent of the conjecture, we are able to prove a strong result on the cardinality of 
uniformly distributed B*[g] sets. 

Theorem 5.9. Let g > 2 be an integer. Let Sn ^ {1, . . . ,n} be a sequence of B*[g] sets that 
becomes uniformly distributed as n —>■ 00. Then \Sn\ < 1.15988,/^. 
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Proof. Let /„ be the nif corresponding to S'„. Since the Sn become uniformly distributed, the 
functions /„ converge in measure to the function that is identically 2 on [— |, |]. Therefore 

^1/4 2 
J-l/i 

Now, apply Lemma |4.11| to find liminf^ \\.fn*.fn\\oo > 1.486634. Since ||/„*/„||oo = 

we have shown |S'„| < 1.15988y^^. ■ 



In light of Theorem p.9| , we see that Conjecture |5]^ implies that p{g) < 1.15988, improv- 
ing Theorem |1.2| for all sufficiently large values of g. 
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6 Constructions of B*[g\ Sets and Lower Bounds for 

R[g, n) 

We begin by considering a modular version of B*[g\ sets. A set is a B*[g\ (mod n) set 
if for any given m there are at most g ordered pairs (si,S2) & S x S with si + S2 = m 
(mod 77.) (equivalently, if the coefficients of the least-degree representative of (Xlnes'^'^)^ 
(mod z'^ — 1) are bounded by g). For example, the set {0, 1, 2, 4} is a 5* [3] (mod 7) set, and 
{0, 1, 3, 7} is a B*[2\ (mod 12) set. Note that 7 + 7=1 + 1 (mod 12), so that {0, 1, 3, 7} 
is not a "modular Sidon set" as defined by some authors, e.g., |pS80|| or [Puy94] , Problem 
CIO]. It is, however, the natural companion to the study of B*[g] sets, as evidenced by the 
clean forms of Propositions and ^]9| below. 

Just as we defined R{g,n) to be the largest possible cardinality of a B*[g] set contained 
in [0,n), we define C{g,n) to be the largest possible cardinality of a B*[g] (mod n) set. 
The mnemonic is "R" for the Meal problem and "C" for the Circular problem. After ex- 
hibiting some basic bounds for this new function C{g,n) in the next section, we construct 



some explicit families of large B* [g] (mod n) sets in Section |6.2| , which are used in turn to 



construct large B* [g] sets themselves in Section |6]^. We also demonstrate the existence of 



large B*[g] (mod n) sets via a probabilistic construction in Section |6.3| , and we give a similar 



probabilistic construction of large B* [g] sets in Section |6.5| . These constructions will yield 



lower bounds on the size of B*[g] sets, which we collect in Section 3.6 



C{g,n 




6.1 Upper Bounds on C{g,n) 

Since B* [g] (mod n) sets have not been rigorously developed in the literature, we begin this 
section by giving some simple upper hounds on C{g,n). 
We can obtain 

g even, 
/gn +1 g odd, 

from the pigeonhole principle as follows. There are 15*^ pairs of elements from S, and there 
are just n possible values for the sum of two elements, and a possible value is realized at 
most g times. Thus IS"! < ^/gn. The only way a sum can occur an odd number of times is 
if it is twice an element of S, so for odd g, l^p < {g — l)n + \S\. 
For g = 2 we can be more precise: 

{2) ^ Lt J • To estabhsh this, let S C [0, n) be a 
witness and note that there are ('2') pairs of distinct elements from S, and each such pair 
si, S2 leads to a pair of differences {si — S2, S2 — si} G {{i, n — i}: 1 < i < n}. If each of 
(si, S2) and (s3, S4) is a pair of incongruent elements and if si — S2 = S3 — S4 (mod n), then 
Si + S4 = S4 + Si = S2 + S3 = S3 + S2 (mod n). The fact that 5* is a B*[2] (mod n) set forces 
{51,54} = {^2, S3} (mod n), and since Si ^ S2 (mod n) by assumption we conclude that 
Si = S3 (mod n) and S2 = S4 (mod n). Therefore distinct pairs of incongruent elements lead 
to distinct sets of differences, of which there are at most [n/2j , establishing (^^2'"^) ^ Lf J • 
This bound is actually achieved for n = p"^ + p + 1 when p is prime (see Proposition |6.1| (iii) 
below) and implies that C(2,n) < y/n + 1. 
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6.2 Explicit Constructions of B*[g\ (mod n) Sets 

We turn now to the problem of constructing large B* [g\ (mod n) sets. The literature contains 
several examples of families of B*[2\ (mod n) sets which we can generalize to families of 
B*[g\ (mod ra) sets. The following proposition collects several lower bounds for C{g,x) 
corresponding to various constructions given in the proofs thereafter. 

Proposition 6.1. Let p be a prime power, and let 1 < k < p. 

i. If p is a prime, then C{2k'^,p^ — p) > k{p — 1); 
zz. C(2fc2,p2 -i)> kp; 

lii. C{2k^, p^ +P + 1) > kp + 1. 



The k = 1 cases of Proposition |0| (i) , (ii), and (iii) are due to Ruzsa |[Ruz93|| , Bose Pos42|| , 



and Singer [ 3in3^| , respectively. Part (i) uses the existence of a primitive root modulo a prime 



p and the Chinese Remainder Theorem (the fact that the orders of the additive and multi- 
plicative groups modulo p are relatively prime is important, which is why the construction 
does not generalize to all finite fields). Part (ii) uses the existence of the finite fields GF(p*) 
with cyclic multiplicative groups and their vector space structures. The third part also uses 
a fact (Lemma |6.3D connecting the multiplicative group of a finite field to its vector space 
structure. 

All three constructions make use of the following lemma. Although the lemma is a special 
case of unique factorization, we give here a simple, elementary proof of the special case that 
we require. 

Lemma 6.2. In any field, there are at most 2 ordered pairs {a,b) of solutions to the poly- 
nomial equation — Cix + C2 = (x — a)(x — 6). 

Proof. Suppose that there are three solutions (am,&m), 1 < m < 3. Obviously we have 
ai + 6i = Ci = 02 + 1)2 and aibi = C2 = ci2&2- This leads to = ai(ai + hi — a2 — 62) = 
al + Oifei — aia2 — ai&2 = al + 02^2 — 0-102 — 01^2 = (oi — 02)(oi — ^2), whence ai G {02, &2}- 
If Oi = 02, then bi = 62 and the three solutions are not distinct. Otherwise ai = b2, and 
likewise ai = 63, whence &2 = again a contradiction. ■ 



Proof of Proposition \6.I\ fi). Let ghea primitive root modulo p. Using the Chinese Remainder 



Theorem, define at^i for 1 < t < p and 1 < z < A; by the pair of congruences 

ttt^i = t (mod J5 — 1) and a^^j = ig^ (mod p). (29) 

Set Si := {at^i : 1 < t < p}; clearly l^il = p — 1. we shall show that IJi=i 'S'j witnesses 
C{2k'^,p{p-1)) > k{p-l). 

Suppose ami,i = cim2,j^ with mi,m2 G [l,p). We have mi = ami,i = CLm2,j = '^2 (mod p — 
1), so mi = m2. Reducing the equation 0^1,1 = 0,^2 ,j modulo p, we find ig"^^ = jg"^'^ = jg"^^ 

(mod p), so i = j. Thus Si and Sj are distinct for distinct z, j, and IJi=i Si = k{p — 1). 

First assume that, given any z, j G [1, k] and any n G [l,p(p — 1)], there are at most two 
pairs (r, v) & Si x Sj with r + v = n (mod p{p — !))■ Since there are fc^ such choices of i,j, 
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and each such choice leads to at most two pairs with a given sum, this shows that each n 
arises as the sum of at most pairs, concluding the proof. 

Now suppose if possible that there are three pairs (a^^^j, a^,^ E Si x Sj satisfying 
ar„,i + CLvrr^j = (mod p{p — 1)). Then obviously ar^^^i + dv^j = n (mod p). Also, since 
ar^,i + a^^j = n (mod p - 1), we have a^..^,, ■ a^,„j = ig^'^jg""' = ijg''^+'"'^ = ijg"" (mod p). 
Thus the pairs (a^^^j, av,n,j) solutions to x"^ — nx + ijg^ (mod p). Applying Lemma |0| , 
we find that two of the pairs (ctr^.i, c^i;™ j) are equal, say a^j^j = a,.2,j (mod p). By Eq. (PP|), 
2(y(''i = Oj-i^i = 0^2,1 = ^5''^^ (mod p). Since (7 has order p—l, this tells us that a^i^j = ri = r2 = 
0^2,1 (modp— 1). Since Ori^j = aj.2,i (mod p), aj.^,, = aj.2,i (mod p — 1), and Of^i G [l,p(p— 1)) 
(by definition), we see that j, whence the three pairs are not distinct. ■ 

Proof of Proposition ]^.]\ (ii). Let 6 generate the multiplicative group of the finite field GF(p^), 
and observe that {1,6'} is a basis of GF(p^) as a vector space over GF(p). For i G GF(p), 
define 

Si := {s' G - 1] : r' = 20 + s, sE GF{p)}. 

Bose ||Bos42|| showed that each Si is a Sidon set. we shall show that S = IJi=i witnesses 
C(2A;2,p2_ 1) > 

First, note that for each s G GF{p), there is an integer s' G — 1] with 6^ = i6 + s, 
so that \Si\ = p. Since 1, is a basis for GF(j9^) over GF(p), we know that i9 + Si = j9 + S2 
implies that i = j and si = S2. In particular, if z ^ j, then 5*1 fl Sj = 0. Thus | IJ^^i = ^P- 

It is sufficient to show that for each n and any i,j (for which there are fc^ choices) there 
are at most 2 pairs (r.^, v'^) in 5*4 x S'j with r'^ + v'^ = n (mod — 1). Define ci, C2 G GF(p) 
by {ij)~^0" — 9"^ = ciO + C2. Since + t;^ = n (mod p^ — 1), we have 

c^e + C2 = (ijT^r - 02 ^ _ ^2 ^ (ij)-!^-^''- - 02 = 

This means that (a, 6) = (^^^'^r^, j~^t>m) is a solution to a;^ — cix + C2 = {x — a){x — b). By 
Lemma |6.2| , there are at most two such pairs. ■ 

To extend Singer's ||Sin38|| construction, we shall need the following lemma. 

Lemma 6.3. Let p be a prime power, and let 6 G GF{p^) generate the multiplicative group. 
Then 0" and 0* are linearly dependent over GF{p) iff a = b (mod p'^ + p + 1). 

Proof. The multiplicative group of GF(p) is a subgroup of the multiplicative group of GF(p^), 
i.e., GF(p) = {0^^ : A; = (mod ^^^)}- Since two elements of GF(p'^) are linearly dependent 
over GF(p) exactly if their ratio is in GF(p), we see that 0° and 0^* are linearly dependent 
exactly if a — 6 = (mod ^^f^-)- ■ 

Proof of Proposition \6^ (iii). Let G GF(p^) generate the multiplicative group. Since is 
algebraic with degree 3 over GF{p), the elements 1,0, 0^ are a basis for GF(j9^) over GF{p). 
Define 

Ti := {0} U {s' G [l,p^ - 1] : 9'' = te + s, s G GF(j9)} 

for 1 < i < A;, and define Si to be the set of congruence classes modulo q := = p"^ +p+l 
which intersect T,-. 
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Since each nonzero s' G Tj corresponds to a unique s G GF(p), and for each s G GF(p) 
there is an s' G [l,p^ — 1] with 9^ = iO + s, we see that |Tj| = |GF(p) \ + 1 = p + 1, and so 
by virtue of Lemma |673| , |5'j| = p + 1. Also, each s' 7^ occurs in at most one of the Tj, so 
that I ULi Si\ = kp + I. 

We wish to show that, given any n, there are at most two pairs {r',v') & Th x Tj with 
r' + v' = n mod q. Since there are k'^ choices of h,j, this will establish that IJi=i witnesses 
C(2/c2,p2 + p + 1) > kp + 1. Define Lq = 1, and for each x' G Tj set L^/ = 6''''' = iO + x. 

Suppose that n is fixed, and {r'^yv'^) G T/j x T,- (1 < m < 3) are distinct pairs 
with r'^ + v'^ = n mod q for 1 < m < 3. It follows from Lemma ^]3| that each pair of 
L^i^Lyi^, Lr'^Lyi^, Lj,i^Lyi^ 316 llucarly dependent, i.e., they are multiples of each other. If r[ = 0, 
then Lr'^Lyi^ is linear. This means that L^'^L^i^ is also linear, and so one of r2,f2 is zero and 
the other is equal to v'^ G Tj. If r'2 = 0, then {r[,v[) = {r'2,v'2), contradicting distinctness, 
and if = 0, then v'l = r'2 (mod g), which is only possible if v'^ = = 0, whence again 

= ij'2,v'2). 

Thus each L^i L.,i is a quadratic in 9. The coefficient of 9"^ in each ^" = L^i L„i = 
{h9+rm){j9+Vm) = hj9'^ + {rmj +Vmh)9+rmVm is hj. Since the Lr'^L^i^ are multiples of each 
other with the same lead coefficients, they must in fact be equal. Set ci, C2 by {hj)~^9'"' — 9^ = 
Ci9+C2, and observe that (a, 6) = {h~^rm, j~^Vm) is a solution to a;^ — cix + C2 = {x — a){x — b); 



hence, by Lemma 3.2, there are only two such pairs. 



We now show how to combine two B*[g] (mod n) sets to construct another. 

Proposition 6.4. Let gcd{x,y) = 1, and let S be a B*[g] (mod x) set and M be a B*[h] 
(mod y) set. Then the set M + yS := {m + ys: m E M, s E S} is a B*[gh] (mod xy) set. In 
particular, C{gh, xy) > C{g, x)C{h, y) for any positive integers g, h, x, y with gcd(x, y) = 1. 

Proof. Consider mi,ni E M and Si,ti G 5* with 

(mi + ysi) + {ni + yti) = ■■■ = {rugh+i + ysgh+i) + (%/i+i + ytgh+i) (mod xy). (30) 

We need to show that mi = nij, Si = Sj, = uj, and tj = tj, for some i,j. Reducing these 
Eq. ( ppp modulo y, we see that mi + ni = m2 + n2 = ■ ■ ■ = rng^^i + Ug^^i (mod y). Since 
M is a B*[h] (mod y) set, we can reorder the mi,ni, Si,ti so that mi = m2 = ■ ■ ■ = m^+i 
and ni = n2 = ■ ■ ■ = %+i. Reducing Eq. ( ^0|) modulo x we arrive at 

ysi + yti = ys2 + yt2 = ■ ■ ■ = ysg+i + ytg+i (mod x) 

whence, since gcd(x, = 1, 

si+ti = S2 + t2 = ■ ■ ■ = Sg+i + tg^i (mod x). 

The Si and ti are from a B*[g] (mod x) set, so that for some i, j, Si = Sj and = tj. m 

We have computed C{g,n) for small and n by exhaustive search. The results are 
summarized in Table |l]. The entry for {k,g) = (10,5) is 28; this means that C(5,28) > 10 
(a witness is {0, 1, 2, 4, 5, 8, 12, 15, 23, 24}), while there is no n < 28 for which C(5, n) > 10. 

It is straightforward to verify that C(5,28) > 10: one simply verifies that the witness 
has 10 elements and is indeed a B*[5] (mod 28) set. It is not straightforward, however, to 
verify that C{5,n) < 10 for n < 28. We have made these verifications for each entry given 
in the table by a long computer search using Mathematica. 
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Table 1: min{n: C{g,n) > k} 



6.3 Probabilistic Constructions of B'^[g] (mod n) Sets 

The algebraic methods of the previous section provide effective and completely explicit con- 
structions of large B*[g] (mod n) sets. However, we can establish the existence of even 
larger B*[g] (mod n) sets using a probabilistic construction. We rely upon the following two 
lemmas, which are quantitative statements of the Law of Large Numbers for sums of many 
independent random variables. 

Lemma 6.5. Let pi, . . . ,Pn be real numbers in the range [0, 1], and set p = {pi + - ■ ■ + p„,)/n. 
Define mutually independent random variables Xi, . . . , X„ such that Xi takes the value l—pi 
with probability Pi and the value —pi with probability I — Pi (so that the expectation of each 
Xi is zero), and define X = Xi + ■ ■ ■ + X„. Then for any positive number a, 

Pr[X>a]<exp(^ + ^) and Pr[X < -a] < exp (^) . 



Proof. These assertions are Theorems A. 11 and A. 13 of |[ASOCI|| . ■ 

Lemma 6.6. Let pi, . . . ,pn be real numbers in the range [0, 1], and set E = pi + ■ ■ ■ + pn- 

Define mutually independent random variables Yi, ... ,1^ such that Yi takes the value 1 with 
probability pi and the value with probability 1 — pi, and define Y = Yi + ■ ■ ■ + Yn (so that 
the expectation of Y equals E). Then Pr[y > E + a] < exp (^^) for any real number 
< a < E /?>, and Pr[y < E — a] < exp (^^) for any positive real number a. 

Proof. This follows immediately from Lemma |675| upon defining Xj = — pi for each i and 

3 2 

noting that E = pn and that < under the assumption 0<a<-E'/3. ■ 

We now give the probabilistic construction of large B*[g] (mod n) sets. 

Proposition 6.7. For every < e < 1, there is a sequence of ordered pairs {nj,gj) of 
positive integers such that '^^^^''^^^ > e and — ^ e'^. 
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Proof. Let n be an odd integer. We define a random subset S" of {1, . . . ,n} as follows: for 
every 1 < i < n, let Yi be 1 with probability e and with probability 1 — e with the Yi 
mutually independent, and let S := {i : Yi = 1}. We see that \S\ = X]r=i expectation 
E = en. Setting a = y/en log 4, Lemma O] gives 



Pr [l^l < en - a/ en log 4 ] < 



2 

Now for any integer k, define the random variable 

R,^ ■= #{1 <c,d<n: c + d = k (mod n), Yc = Yd = 1} 

c+d=k (mod n) 

SO that Rk is the number of representations of k (mod n) as the sum of two elements of S. 
Observe that R^ is the sum of n — 1 random variables taking the value 1 with probability 
e'^ and the value otherwise, plus one random variable (corresponding to c = d = 2~^k 
(mod n)) taking the value 1 with probability e and the value otherwise. Therefore the 
expectation of Rk is E = {n — l)e'^ + e. Setting a = ^/S{{n — l)e'^ + e) log2n, and noting 
that a < E/3 when n is sufficiently large in terms of e, Lemma |6.6| gives 



Pr [Rk > (n - l)e^ + e- ^3{{n - l)e^ + e) log2n] < ^ 

for each 1 < k < n. Therefore, there exists a B*[g] (mod n) set S C {l,...,n}, with 
g < {n — l)e'^ + e — ^/S{{n — l)e'^ + e) log2n < e'^n, such that 15*1 > en — y/en log 4 > en. 
This establishes the proposition. ■ 

Define At(£:) to be the supremum of those real numbers S such that every subset of T 
with measure e has a subset with measure S that is fixed by a refiection t ^ c — t. The 
function iS.fi^e) stands in relation to C{g, n) as A(£) stands to R{g, n). However, it turns out 
that At is much easier to understand: 

Corollary 6.8. Every subset ofT with measure e contains a symmetric subset with measure 
e^ , and this is best possible for every e. In particular, = £^ for all Q < e <1. 



Proof. In the proof of the trivial lower bound for A(£) (Lemma p.2| ), we saw that every 
subset of [0, 1] with measure e contains a symmetric subset with measure at least The 
proof is easily modified to show that every subset of T with measure e contains a symmetric 
subset with measure e"^. This shows that At(£:) > e^ for all e. On the other hand, the 
proof of Proposition O is also easily modified to show that At ( '-^'^^'"^ j < as is the proof 



of Lemma p.5| to show that A^ is continuous. Then, by virtue of Proposition |6.7| and the 
monotonicity of At, we have At(£:) < e'^. ■ 

6.4 Explicit Constructions of B*[g\ Sets 

If S" C [0, 100) is a B*[g\ (mod 200) set, then the modular sums are the same as the real 
sums, and so 5* is a B*[g] set as well. This observation is the fundamental idea behind using 
the method of Proposition to construct B* [g] sets. 



53 



Proposition 6.9. Let g, h, x, y be positive integers. Then 



R{gh, xy) > Righ, xy + l- ) > R{g, x)C{K y). 

Proof. Let M C [l,m] C [l,y] witness the value of C{h,y), and let S C [0,s) witness the 
value of R{g, s). Take x > 2s, and relatively prime to y, and note that S" is a B*[g] (mod x) 
set. By Proposition the set M + yS is a. B*[gh] (mod xy) set. But by taking x to be 
sufficiently large, we see that M + yS is actually a B*[gh] set. 

We now compute the smallest and largest element of M + yS. Clearly the smallest 
element is 1, and the largest is m + y{s — 1). Since M is a B*[h] (mod y) set, we may shift 
it modulo y so as to minimize m. M has C{h, y) elements, so there must be two consecutive 
elements whose difference (mod y) is at least [ ^^.^ , i.e., we may take m < y+1— [f^^f^] • 
Thus M + yS C[l,y + 1- \^] + y{s - 1)] = [l,ys + 1 - [^]], and |M + ys] = 
R{g,s)C{h,y). m 

The reader might feel that the part of the argument concerning the largest gap in M is 
more trouble than it is worth. We include this for two reasons. First, Erdos | |Guy94 , Problem 
C9] offered $500 for an answer to the question, "Is R{2,n) = ^/n + O (1)?" This question 
would be answered in the negative if one could show, for example, that the B* [2] (mod p^ — l) 
sets constructed by Bose (the k = 1 case of Proposition |6ri|(ii)) contain a gap which is not 
O (p), as seems likely from the experiments of Zhang ||Zha94|| and Lindstrom [[Lin98|| . Second, 
there is some literature (e.g., ||ESS95|] and ||Kuz96|| ) concerning the possible size of the largest 
gap in a maximal Sidon set contained in {!,..., ra}. In short, we include this argument 
because there is some reason to believe that this is a significant source of the error term in 
at least one case, and because there is some reason to believe that improvement is possible. 

Our plan is to employ the inequality of Proposition |6.9| when y is large, h = 2, and x ^ ^g. 
In other words, we need nontrivial lower bounds for C(2,n) for n ^ oo and for R{g,n) for 
values of n that are not much larger than g. The first need is filled by Proposition |6.1|, while 
the second need is filled by the following lemma. 

Lemma 6.10. For all g > 1 we have R{g, 3g - [g/3\ + 1) > g + 2 [g/3\ + [g/Q\ . 
Proof. One can verify that a witness is 



0, 



+ 2 



0, 



u 



9,9 + 



U (2g 



,^9 



We remark that this family of examples was motivated by the finite sequence S = 
{1, 0, |, 1, 0, 1, 1, 1}, which has the property that its autocorrelations are small relative to 
the sum of its entries. In other words, the ratio of the £°°-norm of S* * S* to the £^-norm 
of S itself is small. If we could find a finite sequence of rational numbers for which the 
corresponding ratio were smaller, we could convert it directly into a family of examples that 
would improve the lower bound for p{2g) in Theorem |1.3| for large g (see the proof of the 
theorem in Section |6.6| ). 

In addition to these parametric results, we have established by direct (exhaustive) compu- 
tation the exact value of R{g, n) for small values of g and n. Table ^ records, for given values 
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Table 2: min{n : R{g, n) > k} 



of g and k, the smallest possible value of max 5* among all B* [g] sets S consisting of exactly k 
positive integers; in other words, the entry corresponding to k and g is min{n: R{g, n) > k}. 
For example, the {k, g) = (8, 2) entry records the fact that there exists an 8-element Sidon 
set of integers from [1,35] but no 8-element Sidon set of integers from [1,34]. 

To show that R{2, 35) > 8, for instance, it is only necessary to observe that the witness 
{1,3,13,20,26,31,34,35} has 8 elements and is a B*[2] set. To show that i?(2, 35) < 8, 
however, seems to require an extensive search. 

6.5 Probabilistic Constructions of B*[g] Sets 

We can use the probabilistic methods employed in Section |6.3| to construct large B* [g] sets in 
Z. The proof is more complicated because it is to our advantage to endow different integers 
with different probabilities of belonging to our random set. Although all of the constants 
in the proof could be made explicit, we are content with inequalities having error terms 
involving big-0 notation. 

Proposition 6.11. Let j > n be a real number and n > 'j be an integer. There exists a 
B*[g] set 5 C {l,...,n}, where g = 7 + 0(^7 logn), with \S\ > 2^^ + 0(7 + (7^)1/^). 

Proof. Define mutually independent random variables Yk, taking only the values and 1, by 
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R{g,x) 



Witness 



R{g,x)/y/2gx 



~2~ 

VIS 

2 



3V10 
V3 

Jl 
V2IO 
17 

4\/30 

4 

3^3 

j|! 

5\/22 



2 7 4 {1,2,5,7} 

3 5 4 {1,2,3,5} 

4 31 12 {1,2,4,10,11,12,14,19,25,26,30,31} 

5 9 7 {1,2,3,4,5,7,9} 

6 20 12 {1,2,3,4,5,6,9,10,13,15,19,20} 

7 15 11 {1,2,3,7,8,9,10,11,12,13,15} 

8 30 17 {1,2,5,7,8,9,11,12,13,14,16,18,26,27,28,29,30} 

9 24 16 {1,2,3,4,5,6,7,8,9,13,14,15,17,22,23,24} 

10 33 20 {1,2,4,5,6,7,8,9,10,11,13,15,20,21,22,23,30,31,32,33} 

11 25 18 {1,2,3,4,5,11,12,13,14,15,16,17,18,19,20,21,23,25} 

Table 3: Important values of R{g,x) and witnesses 



0.756 
0.730 
0.756 
0.738 
0.775 
0.759 
0.776 
0.770 
0.778 
0.768 



l<k<\ (31) 
k > n. 

(Notice that pk < for all k > 1.) These random variables define a random subset 
5* = {fc : Yfc = 1} of the integers from 1 to n. We shall show that, with positive probability, 
is a large B*[g\ set with g not much bigger than 7. 
The expected size of S is 



2./^- ^ + 0(1). (32) 
If we set ao := A/2-E'olog3, then Lemma ^]6| tells us that 

Pr[|5|<£;o-ao]<exp {^) =\- 
Now for any integer A; G [7, 2n], let 

l<j<rt l<j<fc/2 

the number of representations of A; as A; = Si + S2 with Si,S2 G S'. (Here we adopt the 
convention that Yk/2 = Pk/2 = if A; is odd). Notice that in this latter sum, and 
the YjYk-j are mutually independent random variables taking only values and 1, with 



Pr{n = l}=pfc := 



nk ' 



0, 
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PT[YjYk-j = 1] = PjPk-j- Thus the expectation of Rk is 



Ek:=2 ^ PjPk~j+Pk/2 < 2 ^ 

l<j<k/2 l<j<k/2 



TT 



k/2 




7 



TT{k-j] 



+ 



7 



1 



/27 

t{k-t) \ nk 



nk/2 



7 + 



27 
7rk 



(33) 



using the inequahties pk < a/^ and k > •y. 

If we set a = a/3(7 + 1) log3n, then Lemma |6]^ tells us that 

^2, 



Pr[i?fc > 7 + 1 + a] < Pr[i?fc > Ek + a] < exp 



3-Ei. 



< exp 



3(7 +i: 



1 

3n 



for every k in the range '~f < k < 2n. Note that Rk ^1 trivially for k in the range 1 < < 7. 
Therefore, with probability at least 1 — | — (2?2 ~ 7)3^ = 3^ > 0, the set S has at least 
- ao = 2a/^ + 0(7 + (7Ti)^/^) elements and satisfies < 7 + 1 + a for all 1 < A; < 2n. 
Setting 5f:=7 + l + a = 7 + 0(a/7 logn), we conclude that any such set 5 is a B*[g\ set. 
This establishes the proposition. ■ 

Schinzel conjectured that among all pdfs supported on [0, |], the function 



/2x' ~" L"' 2J' 

0, otherwise 



has the property that ||/ * / ||oo is minimal. We have 

/ * f{x) - 



TT 
2 ' 



f - 2tan-i ^/2^ 
0, 



xe [0,i], 

otherwise 



and so ||/*/||oo = f - We have adapted this function in our definition ( pT]) of the probabilities 
Pk] the constant | appears as the value of the last integral in Eq. (^). If Schinzel's conjecture 
were false, then we could immediately incorporate any better function / into the proof of 
Proposition and improve the lower bound on IS*]. Indeed, Schinzel's conjecture is one of 
the motivations for our Conjecture |5.8| , which by the above discussion is logically stronger. 

Theorem |1.4| . For any S > 0, we have R{g,n) > (-^ — 5)y/gn if both and ^ 
sufficiently large in terms of 5. 



are 



Proof. In the proof of Proposition |6.11| , we saw that 7 < (7 and = 7 + 0(1/7 log this 
implies that 7 = 5' + g log n)=g{\ + 0{,J^)). Therefore the size of the constructed 
set S was at least 



2\/f + 0(7+(7^) 



This establishes the theorem. 



l/4x 



2n(l + o(^/^ 



0{g+{gnf") 



1 + 



log n 



+ 
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6.6 Lower Bounds on R{g^ n] 



We are now ready to prove Theorem |1.3| , which we restate here for the reader's convenience. 
Recall that p{g) = liminf„_»( Mmi) 



Theorem |1.3| . We have 



P(4) 


> 


2 

V7 


> 0.755, 


p(14) 


> 


11 


> 0.759, 


p(6) 


> 


2V2 
/15 


> 0.730, 


p(16) 


> 


17 
4v^ 


> 0.775, 


P(8) 


> 


2 

Vi 


> 0.755, 


p(18) 


> 


4 
3v^ 


> 0.769, 


p(10) 


> 


7 

3v^ 


> 0.737, 


p(20) 


> 


2v^ 


> 0.778, 


p(12) 


> 


V5 


> 0.774, 


p(22) 


> 


18 
5^22 


> 0.767, 



'6g' - 2g [j/3J + 2g 

11 



and for any g > 12, 



In particular, for any 5 > we have R{g, n) > — 6)y/gn if both g and ^ are sufficiently 
large in terms of 6. 

Proof. For any positive integers x and m < ^Jnjx, the monotonicity of R in the second 
variable gives R{2g,n) > R{2g,x{m? — 1)) > R{g,x)C{2^m? — 1) by Proposition If we 



choose m to be the largest prime not exceeding ^Jn/x (so that m > ^Jn/x by the Prime 



Number Theorem), then Proposition |6.9| gives R{2g, n) > R{g, x) ■ m > R{g, for any 

fixed positive integer g, and hence 

p(2,) = liminf^5^^>^^^'^) 



\f2gn \J2gx 

The problem now is to choose x so as to make ^^7^^ as large as we can for each g. For 
g = 2,3, ...,11, we use Table || to choose x = 7,5,31,9,20,15,30,24,33,25, respectively 
(see Table § for witnesses to the values claimed for R{g,x)). This yields the first group of 



assertions in Theorem |1.3| . For g > 12, we set x = 3g—[g/3\+l and appeal to Theorem |6.10 
giving the second assertion of Theorem |1.3| . 

We remark that the above proof gives the more refined result 



11 , / ^/ 1 /n\("-i)/2 

Ri2g, ^) > ^ \/2^ (l + Oi^g-'+i^- 



as ^ and g both go to infinity, where a < 1 is any number such that for sufficiently large 
y, there is always a prime between y — y°' and y. For instance, we can take a = 0.525 
by ||BHP01|| . This clarification implies the final assertion of the theorem for even g, and the 
obvious inequality R{2g + l,n) > R{2g,n) implies the final assertion for odd g as well. ■ 



Habsieger and Plagne |[HP|] have proven that i?(2,x)/v4x is maximized at x = 7. For 



g > 2, we have chosen x based solely on the computations reported in Table 0. For general 
g, it appears that R{g, x) / \/2gx is actually maximized at a fairly small value of x, suggesting 
that this construction suffers from "edge effects" and is not best possible. 
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7 Upper Bounds for A(£) 



7.1 Upper Bounds Derived from Constructions of B*[g\ Sets 

In Section 5 we used the connection between B*[g\ sets and measurable sets with small 
symmetric subsets to deduce upper bounds for R{g,n) from lower bounds for A{e). In this 
section we exploit this relationship in the opposite direction, converting the lower bounds 
on R{g, n) established in Section 6 into upper bounds for A{e). Our first proposition verifies 
the statement of Theorem |l.l| (i). 

Proposition 7.1. A{e) = 2e - 1 for ^ < e < 1, and A{e) > 2e - 1 for all < e < 1. 

Proof. Recall from Lemma p.5| that the function A satisfies the Lipschitz condition |A(a;) — 
A{y)\ < 2\x — y\. Therefore the inequality A{e) > 2e — 1 for all < £ < 1 follows easily 
from the trivial value A(l) = 1. To prove that A(e) = 2e — 1 for y| < £^ < 1, then, it suffices 
to prove that A{e) < 2e — 1 in that range; and again by the Lipschitz condition, it suffices 
to prove simply that A (y|) < 



For any positive integer g, we combine Proposition ^.l] and Lemma |6.1CI| and the mono- 
tonicity of A to see that 

9 > ^( Rig,3g~[g/3\+l) \ ^ J g + 2[g/3\ + [g/6\ \ 

3g-[g/3\+l- \ 3g-[g/3\+l J- \ 3g-[g/3]+l J' 

Since A is continuous by Lemma |3]^, we may take the limit of both sides as (jf ^ oo to 
obtain A (y|) < | as desired. ■ 

Proposition 7.2. The function is increasing on (0, 1]. 
Proof. Choose < e < ^o- By Proposition B^, we have 



R{g,x) C{h,y) ^ R{gh,xy) 
X y ~ xy 

With the monotonicity of A{e) and Proposition this gives 



^ / R{g,x)C{h,y) \ ^ ^ r R{gh,xy) \ ^ gh 
\ X y ) ~ \ xy ) ~ xy 



Let gi^Xi be such that — + Eq and — — > Afeo), which is possible by Proposition ^ 
By Proposition |6?7| , we may choose sequences of integers hj and yj such that > ^ 

and — < (^)^ &s j — > oo. This implies 



R{3^C{h^^^ and ^^<A(.o)f-V, 



X yj ~ Xi yj ~ v^o^ ' 

so that, again using the monotonicity and continuity of A, 



e 



2 



^ A f Ri9i,Xi) C{hj,yj) 



^0 ■^iyj V yj 

A{e) ^ A(eo) 



as j ^ OO. This shows that — < as desired. 
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We can immediately deduce two nice consequences of this proposition. 
Corollary 7.3. lim£^o+ exists. 

Proof. This follows from the fact that the function is increasing and bounded below by 
I on (0, 1] by the trivial lower bound (Lemma J72). ■ 

Corollary 7.4. A{e) < forO<e <^. 

Proof. This follows from the value A(y|) = | calculated in Proposition 7A. and the fact that 
the function is increasing. ■ 

The corollary above proves part (iv) of Theorem p..l| , leaving only part (v) yet to be 
established. The following proposition finishes the proof of Theorem |1 . 1|. 



Proposition 7.5. ^ < 1— ^ for all < e < 1 



Proof. Define a := 1 — — e, so that 2a — = e. If we set 7 = na'^n in the proof of 
Proposition |6.11| , then the sets constructed are B*[g] sets with g = na'^n + 0{^/n\ogn) and 
have size at least 



— — + 0(1 + ao) = (2a - a')n + 0((7n)^/^) = en + 0{^) 

TT TT 

from Eq. (^). 

Therefore, for these values of g and n, 

^ n ^ n ' 

as n goes to infinity, by the continuity of A. On the other hand, we see by Proposition ^TT| 
that 



2^/;R(5S^x ^ g '^9 ^ 'Ka^n^-0[^Jn logn) 



„2 



.0( 



TTQ; ^/ /logn \ TT . , TT 

' ' ' O 1 



(2a-a2)2 VV e^n ) {2 -of ' ' (1 + ^/f^)^ 
as n goes to infinity. Combining these two inequalities yields < ^^^^^^^ desired. 



7.2 Upper Bounds Derived from Finite Unions of Intervals 

Another way to approach bounding A(e) is to compute precisely Ak{e), the supremum 
of those real numbers 5 such that every subset of [0, 1) with measure e that is the union 
of k intervals has a symmetric subset with measure 6. From the definition it is easy to 
see that Ak{e) is a decreasing function of k. In fact, it follows directly from the proof of 
Proposition |5]^ that A{e) = inf^ Ak{e) = limfc^oo Afc(£:). We also have the following formula. 
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Lemma 7.6. Let E = IJi=i('^«) A) ^ [0, 1) be the union of k disjoint intervals. The largest 
measure of a symmetric subset of E is 

{k k 
y y max {0, min{c — aj, /?j — c} — max{c — ctj — c 
i=l j=l 

Proof. This follows immediately from the fact that, for any real numbers < a, < /3j < 1 and 
c, the measure of the set {x : c — x e (ai, c + x E (0:2, P2)} is max {O, min{c — ai, P2 — c} 
— max{c — 0^2 — c}}. ■ 

Since the maximum over c is achieved for some c that is the midpoint of endpoints of 
the intervals (i.e., c = (aj + /3j)/2, c = {ai + aj) /2, or c = + /?j)/2), this theorem provides 
an effective method for the computation of D{E) for any particular union of k intervals. 
Indeed, this formula reduces finding a particular value Afc(e) to a finite number of linear 
programming problems, though in practice this computation becomes unmanageably large 
even for small values of k. In principal, the entire function could be calculated by solving 
these linear programming problems with the constant e remaining unspecified, branching 
finitely many times depending on various simple inequalities for e. 

The case A; = 2 is simple enough to deal with directly; we state the result in Proposi- 
tion |7.7| but omit the proof. We have shown computationally that the graphs of A3(e) and 
A4(e) lie on or below the polygonal paths described in Conjecture |7]^ (see Figure ||), but we 
have not verified that these upper bounds are in fact sharp. 

Proposition 7.7. The graph of the function (£:,A2(£)) is the polygonal path connecting 
(0,0), (3/4,1/2), and (1,1). 

Conjecture 7.8. The graph of the function ^zi^) is the polygonal path connecting (0,0), 
(|, |), {ji^Yi), (f?f); o^nd (1,1). The graph of the function A4(e) is the polygonal path 
connecting (0,0), (^ |), (^, ^), (if), (fj), (g, ^), and {1,1). 

It seems likely that the graph of the function Aj.{e) always contains the line segment 
connecting (0,0) and (^^, f ), where n is the least integer for which R{2,n) = k + 1. It is 
easy to show that for every k > 2, the graph of the function Ak{e) contains the line segment 
connecting the points (|, |) and (1, 1). 
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0.6- 



. 4 



0.2 - 




Figure 8: The graph of A2{e) and the conjectured graphs of A^i^e) and A4(£:) 

8 Some Remaining Questions 

We group the problems in this section into three categories, although some problems do not 
fit clearly into any of the categories and others fit into more than one. (We also refer the 



reader to the Conjectures and already propounded.) 



8.1 Properties of the Function A(e) 

The first open problem on the list must of course be the exact determination of A(e) for all 
values < e < 1. In the course of our investigations, we have come to believe the following 
assertion. 

Conjecture 8.1. A{e) = max{2£ - 1, fe^} y^r allO <e <l. 



Notice that the upper bounds given in Theorem |1.1| are not too far from this conjecture, 
the difference between the constants ^ = 0.7934 and | = 0.7854 in the middle range 
for e being the only discrepancy. In fact, we believe it might be possible to prove that the 
expression in Conjecture |8.1| is indeed an upper bound for A{e) by a more refined application 
of the probabilistic method employed in Section |6.5| . The key would be to show that the 
various events > 7 + 1 + a are more or less independent of one another (as it stands we 
have to assume the worst — that they are all mutually exclusive — in obtaining our bound for 
the probability of obtaining a "bad" set), so that we could omit the logSra factor from the 
chosen value of a. 

There are some intermediate qualitative results about the function A{e) that might be 
easier to resolve. It seems likely that A{e) is convex, for example, but we have not been able 
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to prove this. A first step towards clarifying tlie nature of A(e) might be to prove that 

\A{x)-A{y)\ 



\x - y\ 



<^ max{x, y}. 



Also, we would not be surprised to see accomplished an exact computation of A(|), but we 
have been unable to make this computation ourselves. 

We do not believe that there is always a set with measure e whose largest symmetric 
subset has measure A.{e). In fact, we do not believe that there is a set with measure 
£0 := infje:: A{e) = 2e — 1} whose largest symmetric subset has measure A{eo), but we 
do not even know the value of Eq. In Proposition |7.1| , we showed that Eq < but this 
was found by rather limited computations and is unlikely to be sharp. The quantity in 
Theorem |1.3| is of the form --p=tt=, and similarly the quantity in Theorem |l.l] (iv) is of 
the form Thus any improvement in the bound Eq < would immediately result in 



improvements to Theorem L3 and Theorem |l.l| (iv). We remark that Conjecture implies 
that £0 = 2+^4— K ~ 0.6834, which in turn would allow us to replace the constant in 

Theorem |1.3| by \/ ^. 



8.2 Artifacts of our Proof 

Let IK be the class of functions K G L^(T) satisfying K{x) > 1 on [— |, |]. How small can 
we make ||i^||p for 1 < p < 27 We are especially interested in p = |, but a solution for any 
p may be enlightening. 

To give some perspective to this problem, note that a trivial upper bound for inf/^gK 
can be found by taking K to be identically equal to 1, which yields \\K\\p = 1. One can 
find functions that improve upon this trivial choice; for example, the function K defined in 
Eq. is an example where ||i^||4/3 = 0.96585. On the other hand, since the £^-norm of 
a sequence is a decreasing function of p, Parseval's identity immediately gives us the lower 
bound lli^llp > lli^lla = \\K\\2 > ( /^^/^ 1^ rft)^^^ = 71 = 0.707107, and of course this is the 
exact minimum for p = 2. 

We remark that Proposition |4.2| and the function b{x) defined after the proof of Corol- 



lary ^]3| provide a stronger lower bound for 1 < j9 < |. By direct computation we have 



1.14939 > ||6 * 6II2, and by Proposition |]2| we have \\b * b\\l > ||-^||4/3 for any K G 



1 4/3 



Together these imply that \\K\\p > ||-ft'||4/3 > 0.96579. In particular, for p = | we know 
the value of infj^eK 1 1 -^11 4/3 to within one part in ten thousand. The problem of determining 
the actual infimum for 1 < p < 2 seems quite mysterious. We remark that Green ||Gre01 



considered the discrete version of a similar optimization problem, namely the minimization 
of ll^llp over all pdfs K supported on [—^, ^]. 



As mentioned at the end of Section |4.2| , we used the inequality \\g\\2 < llfi'lloolls'lli which 
is exact when g takes on one non-zero value, i.e., when g is an nif. We apply this inequality 
when g = f*f with / supported on an interval of length |, which usually looks very different 
from an nif. In this circumstance, the inequality does not seem to be best possible, although 
the corresponding inequality in the exponential sums approach of ||CR'1'|| and in the discrete 
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Fourier approach of [ Gre01 | clearly is best possible. Specifically, we ask for a lower bound 
on 

l|/*/l|oo||/*/l|l 

/To 

supp(/)C[-i,i] 

that is strictly greater than 1. 

8.3 The Analogous Problem for Other Sets 

More generally, for any subset E of an abelian group endowed with a measure, we can define 
Ae(£) := inf{D{A) : A (1 E, X{A) = e}, where D{A) is defined in the same way as in Eq. (|^). 
For example, A[o,i](e) is the function A{6) we have been considering throughout this paper, 
and Aj{e) was considered in Section |67^ . 

We believe that for each E (IM., there is a positive constant c such that A{e) > Ae{s) > 
ce^ for < e < 1. The work of Abbott ||Abb9CI|| seems relevant. If we are concerned only 



with e ^ 0, and we normalize by considering only sets E for which \{E) = 1, then it may 
be possible to take an absolute constant. In other words, is it true that 

inf lim inf — -rr-^ > ? 

\{E)=1 

Most of the work in this paper generalizes easily from E = [0, 1] to = [0, 1]'^. We have 
had difficulties, however, in finding good kernel functions in higher dimensions. That is, we 
need functions K{x) such that 

is as small as possible, while K{x) > 1 if all components of x are less than ^ in absolute 
value. This restricts K on one-half of the space in 1 dimension, one-quarter of the space in 2 
dimensions, and only 2~'^ of the space in d dimensions. For this reason one might expect that 
better kernels exist in higher dimensions, but the computational difficulties have prevented 
us from finding them. 
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